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(54) [5m<OZ,m A>f>?>DNA, *<0^>/^K2fcrWE-©JBifc 



(57) ta*ij] 



1 

- FT£DNAie^J£*l>ri,>&lf*^2CtiafE©& 

Bft. 

[«*ji 4 ] mxm i cctBis©&®xw:4>& < t i 
^ > ^ > ©#& gifts? £ fc#> ©&® ^ p - ^. 

81. Xtt4>&< i & 1 5<I©^©$flg2*U:*-?- F£W 

gB^JXte^fc < i i> 5 ffl©-£©#?Jg7 
[fft#JI7] ^>^>*>£>A?«£;3-F-f£$s 

[s»#jf8] gt#Ji7fciaiS©&i§£^ ; rr£'<2'# 

o 

[twos 9 1 n&m & tazMo^ xmm 

[f**Ja 1 0 ] IE?"J## 6 K*t7 5 vBMBW*Wr 
[f**«l 1 ] ^>*>^>£>A£J(Xttd>Sc< i 

cnrav i 2 ] iB^js-^ e (c^t? 5 y mmnzm-r 

[fS#« 1 3 ] i»*JI 1 1 5ci5iS© £ tmXiZ'P 
tt< ifc 5{1©-?*©E8J£7 5 ^fS^a— FT&RNA6B 

1 4 ] ^>^>^>ta<*„ 
[ft*JH 1 5 ] 4CCfBtS©fa<*£g£-rS^ 

i 6 ] ( nim«» >?•> h >gst#i©^ 

■c. 

(b) ISKf4*©^>^>^>(CAG)n ««*tntar 
SCifc4;or. fgfS*4*©^>?> ?>&&©#£££ 

(c) Ig(b)©#tt£, ^>^> h>ft^p!©St>©& 
I ><H&#> P. 0 fc|B18l©#tFHiI i Jttt b . 

(d) Ctx6©>' , '>9 1 >?>CCAG)n ffi^©#t4^ 



l 

(2) 9$M¥7 -67 66 1 

2 

[ffiWas l 7 ] ^>?-> h >*HSft&&© : ft%£KiB 

(b) fg;^*:A>-f^cti©^>^>^>(CAG)n 
MtiS^^-r £ m > 9 1 >&&<D&&* If If" > 7" n 

v v a#TXU# y y 9 - -fe'lftSfc^Sf «: <£ o rffffi o . 

io i 8 ] i tctai£©^^ ; 5r.«^©Mflacc 
ti«*Ji 1 9 ] t»#ii i 3 ic%&m>mmft*z&m<D 

Si L-C^Z&%<D^Zs?> h ^^^MS-TS^ 

[ s»#ji 2 o ] if am i i (cs ais© ^ > f«K?fr 
-rs. -e©ck i 5^Mg^siL,Ti^.m#©^>g L > 
i ] st*^i i ccsbis© * i'SKjg^ 

[!*#Ji2 2 ] lS{b^* s *>>'^«'C*2>f»*92 

l cciBtg©^ri„ 

coooi] *#iBj©Bi%©i^tc?f t>titcwm*%mm 

30 ^©ttffJ^WUTC^o 

*aiEllil 9 9 3^3 ^5 BKffiBSSnfc^HlfitaJSS 
M08/0 27. 4 9 8-^©— gPiffi^WBS-C*S (C© 

HJBlrt«B9lffl«:«t- 3 r*IHIi»{ctt*S4i&) . 
[0 00 2] %BJ©^i? 

*^Bj«3t{SH©^aso'Mg©^if(cp-r.5„ f*Mfc 
«. *^Bj«^>^>^>ae^ ( i t 1 5ae^-i4> 

40 ©^©j^^K^^ai-r-s/cfe. (2) ^ > ?->i-> 
»^©^»t©/c*. o) ^>?-> h>^)w©Ma 

©/c*. &*>*(4) c©J;^^S©^ggjl©*-^ 
-©/c*. ©^^C*jCt-5C©jte^-R0 £ ^>^ , ^R© 

[000 3] 

[££3fc8fi5] ^>*-> h>«KHB (HD) «, «Sb» 

©tt^ttKS-C^S [MartinR^Cusella, NRRL 
B — .Enql .J.Med. 315:1267-1276(1986)] . CiltJ^SS 

50 fe(*©«ttS8«K:J:r>T*»«36Jn. &WiBS©J&£© 



If 

(3) 

3 

mm<Dto£?\/ \ 0. 0 0 0ffl(fiR*BAl/tt>S 
[Harper, P. S. 6 , in Huntington's disease, W.B.Saun 
ders, V j vt)\>? 4 7 ,1991] 0 H DOSSSttlSIS 

&l^tTt4©ffit££WU 5E£-r&£r©l 0^6 2 0 

*f tt*>/t o eoBfb < it w©sma»©«r & 
o fi» & ffi«*# o x 4*^wcc«3i 5 *i * 

«^#* £ . H D ©e^HffiK««itL©» 
G»C!)flEfi![«:BBJlLri»S. HDCOttfiMHtt, fla©Jl 10 

wawft»««*cr « t>a»tt- * - o 

- a >?z<D£.i t^m&mz cti&xmistix 

Ufa. co««wtttt»c!>»Hac«itf*ii6«, x« 

[ooo4] h Di&mrzm&xmte. t kd^d 

NA7-*-*«fBt4M ( y ##t©S*7J 
«WloiUt, 1 98 3f(Cftfift4r*5i 
'/SJS^tx/c [Gusella6, Nature 306:234-238(196 
3)] . C4»$A»6 % ^©ftSK^OT© 20 

aWWfflftfca-?* . H D iH£^<D*»R^tt-fb©fc 
&©l£S#P~~>^^i£ (location cloning approac 
h)£rJHc5}c0fc [Gusella, FASEB J. 3: 2036-2041(1989) ; 
Gusella, AoV.Hum>Genet. 20:125-151(1991)] . ffil©0f 

?£0*{cB k ^<(D^j£tcJ:^rccDMilK4>«c9rA:ft9 

ET-"7-^i-5:|IJSb [Pohl6, Nucleic Acids Res. 1 
6:9158-9198(1988) ; Whaley6 , Somat.Cell .Mol .Genet. 
17:83-91(1991) ; MacDonald6 , J.Clin.Inv. 84:1013-1 
016(1989)] , m&f&i&<DM&¥ [MacDonald6, Neuron 

3:183-190(1989) : Allitto6, Genomics 9:104-112(19 30 
91)] RVMr&t&M [Bates6, Nature Genet. 1:180-18 7 
(1992); Doucette-Stamm6 , Somat.Cell Mol .Genet. 1 
7:471-480(1591) ; Altherr6 .Genomics 13:1040-1046(1 
992)] fcflfcfcU YAC^a->CC*Jl»*CHD»fi»(0 
4pfny7 <*S8/J\t4) ^^P^>yi [Bates6 t 

Am. J. Hum. Genet. 46:762-775(1990) ; Youngman6 , Gen 
onics 14:350-356(1992)] , YAC (SiAI^ftft) 

[Bates6, Nature Genet. 1:180-187(1992)] &c>'^f$ 

F [Baxendale6, IHSttCCT] M?»Jg¥ (con tigs) 40 

SrLl$14{fc [Thompson^, Genomics 11:1133-1142(199 
1) : Taylor6, Nature Genet. 2:223-227(1992) ; Ambro 
se6, Hum. Mol .Genet. 1:697-703(1992)] TZt><DtfiS> 

fcHDif£^©fiffi-C*S6 0l>4 P 1 6 .3F*9©D4 
S 1 0RtfD4S9 8 mo 2. 2 M b 4>ttM«tt4SH£ 
Stl/c [MacDonald6, Neuron 3:183-190(1989) ; Bates 
6, Am. 3. Hum. Genet. 49:7-16(1991) ; Snell6, Am.J.Hu 
m.Genet. 51:375-362(1992)] c HDc4p 16 . 3 ft© 50 
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4 

DNA-?-*-±<Dm<Dm%.<D¥ffimi% [Snell6, J.M 
ed. Genet. 26:673-675(1989) ; Theil man 6 , J.Med.Gene 
t. 26: 676-681(11.989) ] OlH26C<fcO. ^fl^SSSSS** 

® C o T * <Df$^<DBm t ft o X C > 5 C t ifim ? n/c 

[MacDonald6, An. 3. Hum. Genet. 49: 72 3- 73 4 (1991)] <> 

#Tfc:<J:-?T, *J>Sc< ti> l/3©HD^fef*#5tetie* 
WCCHa 0 C £#*7js 3 ft/c [MacDonald6, Natur 
e Genet. 1:99-103(1992)] 0 C tl6©HD%&tt#*£ 
WTW@M«, *tjt(5^«©»{i6L(r^©iU 
"CD 4 S 1 8 0RtfD4S 1 8 2 lffl©5 0 0 kb -fe^ 

[0 00 5] iie^f £%± fc T 5 fc tf> . C<D5 0 
Okb flM£*-~4 f y7-4>2'U Uttla- FftBW* 

ft: [Buckler6, Proc. Natl .Acad. Sci . .U.S.A. 88:4005- 
4009(1991)] c CCDWBSCCcfcO 3 ~2©jt&?-tfBUCHI£ 
Snri^S : Tf r ^^>(a-adducin)Se j F- (ADD 
A) [Taylor6 , Nature Genet. 2 : 22 3-22 7 (1992) ] ; C 
©Hz tfjAZsh <D&{iLffim<Dl8fe<D&M r^>Xtf-£- 
( I T 1 0 C 3 ) ; aof^SB^CCteWSgratt 
G^U7 u ^>IS^U*fe^^-*^-H2jtG^ ( I T 1 
1) [ Ambrose 6 , Hum. MdI .Genet. 1:697-703(1992)] e 

U*>U Ch6©HeWHiS-rS^PB<bly-rHDffi© 
<fc 0 ft fc (DttJK, tfc fc. 
[0006] 

WJffltffcT r/0^>^>(huntingtin)j XI* r I T 
1 5 J iPf-fe, ^2 1 Okb !£fctatS3*vC 
l>tcl>fa3 4 8 kDa©*>^*»*=i- FTS&AMe 

0, W*tfiB3 7*>6^i>&< £fc7 3<H0r>3f-r* 

ffiWrrajestirc^a. cos«©S?^tttt, He 

Xffl«S&aW»5HBtt^^ hP7^ [Suthers6 , J.M 
ed. Genet. 29:761-765(1992)] tCfcW &|SJ*i(D h U * 
U*^ FS«*«ita3#4. H DJft&*±©^R««cffli 

©(Sffltcfa-r^o fitoT. *»?8j*»i©««tur fc 



(4) 

5 

NAXttRNA***^4«»itailll«»tcHr4. 3 6 

6^>^>?>*^e-rs**feccB8-r4. sect. *a 

[0 00 8 ] 3 6KB, *«H3B, ffitt©XBffi*©& 
«<Dit{5^jR<D*fi6r*or, IEm«5HCC£>Sl 1- 

343f-(D(CAG)n *W^-S«teto&^>*>*> 

fc, #jSWB, ^©XBW*©^<Djtfe?i68fO* 20 
C <D<fc 5 ft JUK&B £ L "C I > 5 .«l*03Ta©|f sa 

<D^ > * > * > mR N A 98SL % Pfi W"T 4 V&ffitt *$m& 

[0009] J/c, #*SHB. ffitt©XBW*OJS«© 

utwiTO^^^^^^aefB 11-3 43f- 

<D(CAG)n If >fX4Ml/tl^ B 36K, 

B. J*©^>*>h>»KBs*^»rU XB^>*> 

toWS^>?> h>«&f«CffaE-rSCCAG)n js«cd 
(D^>^> h>(CAG)n J5tt©»&«<!>S 1*4 C <t * 

[ooio] eimoattw 

[Lin6, Somat.Cell M>1 . Genet. 17: 481-488(1991) 
D4 S 1 0&ZfD4 S 9 8 fffl<D#4ll^ >^LTl> 
KaT4HD^?S^l^<7)gP^ [MacDonald6 , Nature Ge 50 



s 
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net. 1:99-103(1992)] BJR&98B&& UT^OTl* 

C<D«aaOT^KB, D4S180*6D4S1 
8 2 U, 5 KKJiJftf (¥&J4 0 kb/n 

X 5 F) «ttS0TC»So ^VAffiBRtf^ 
(IT15K IT1K IT10C3MADDAil 
{g^CDls^fSj (5 '3&>6 3 '^(DttU) <D^htlXl>6 

JfMrT&D4S 1 2 7SO'D4 S 9 5(D(4gt>=i^5 F 
^jjg¥^0C^Orc^ o WIRWffiBNotl (N), Mlul 
(M)RtfNruI (R)iUSltl^o 7ct£&tB{t*^ 
TSMSBW^^r^U ^MWfcfctt 
9Jfl[Blffll*^r^l/T(r»S. TNJ X^«rHteJSBB 

[0 0 1 1 ] 02 : ( I T 1 5) (g^tt 

AtDS-V^Vv hK*t*T4 I T 1 bA<D'^C?Vif 
^-J/g>(DS*S*Ul»5. fal 1 kb CD# — CD 

RNA^3o-r^<r(DiS5f4*r^a§n, 

0^Kj«RNA?SS0MSCcS*T4fcOr'&o/c o H 

D^fiS^ftBftoturiJO. *ti*ns*T^u*« 

(U->2> at/By^^Xx^K (U->3) Kfi* 
T£kCD~C<fcofco ^Xi7HDSSfe*iD4S12 
7«:*5W4CGT)n ^«acfD4 S 9 5CC*$WSVNT 
RRtfTaqI R F L P sCC J: -orffimZtlZ ^5 2 

2 J <D4 p 1 6 . 3#fflM*WL/"C^4. T> "J 
S^ttB* HD^6ft: T2 11 1 J KJlOttStl* 
S*>5£ilT£4p 1 6 .3l*iaS:a^ltt>4 [MacDo 
nald6, Nature Genet. 1:99-103(1992)] „ 

[0012103 : 1T1 5K¥ft£iWrrScDN 
A*P->CD«SC0r&£ o mm I T 1 5IS?>J©«B«{C 
cfcD, 5o©cDNA**Utl»4. ffitBB^NHIiR^ 

+-^*>^K^<Dg^Metn F>^6fKIR^^3 

n^^m^ns,, MWBJtyTox*v>©**->5 ■ 

*>6 3 '©{ftiB^L-CC^ : DL83D3-8, D 
L83D3-K DL228B6-3, DL228B6 
-5, DL22 8 B6-13, DL69F7-3, DL 
178H4-6, DL 1 1 8 F 5 -URtfDL 1 3 4 B 
9-U4„ CO»fl«3WB«T©J:5 KUTIIWly &. 

3 'ffiij2 2&S*>6ftl^>f ^.S^x^^-Met ATG$*C 
<D£9Q&. 7^0^cDNA^P->S0 r x^V>^6ifi 
«L/c 0 I Tl 6B©3 'SKiBa^I T 1 5B©5 

p^cc b 9 orc^. c n 6 B*fij©m 

c D N A CD P C R ifitsSO* P C R BMO It «fc 4 

4>CDT?&^ 0 ?g«;lH?rJ(D5 '*JB-CB, cDNA^P-> 
I T 1 6CB(CAG)n <D2 7tt[S±8K"ClR±r4. U 
*>U, I T 1 6C^(5j^L/c«^. Sffctt^JBitoiSS 



7 

6*>&Metg8&n F>mi/t3 3 7&S±S£K{$g 

[00 1 3] 14, i5, i6 : >^>=f->^> ( I T 
1 5 ) (l2?iJ#-^5RtfgB?(J#^6 ) OS^12?U"C$) 
£ 0 ^>^>^> ( I Tl 5) CDg/^DNAS^J^IB^J 

07 : (CAG)n S«(DDNA W»ft^^o 
;H, 2SO*3^TDNAi2^Jtt, iES^^5 FL 1 
91F1 ( 1 ) v cDNA I T 1 6C (2) , MHD 
3^SFGUS72-2 1 30(CttmShS(CAG) 

5' GGC GGG AGA CCG CCA TGG CG 
3 ' 

a-*;1/2BpBSKI I T 7 ^ ^ ^ - ( BB9>J*# 
2) SffifflOrfffiSU/c: 

5' A A T ACG ACT CAC TAT AG 3' 

[0014)08: ^mmkzmrm-ofrv^mz 

mc>fc^*Xi^HDPIiafflW)(CAG)n 5a©PCR 

^*Xx^HD»«t©HIB»©PCR»W 

«ttS«fffi^FOfc«)Hftr7^Urc^. AN K AN 2 

acx a n 3 «iESAcC^a*s*<D*fS[jte^-ftiBj(Dti[g 

^Ul^o AEttHDJftfeftft*<DPCR0«!©tt 
±fB<DPCRa^fiCDl:b^O ; 5x.^Wmr^,^^, -e-n 
1 9 1 F 1MGUS72-2 1 3 0fi*OPCRlt 
CAGSjlSrWOTOSo 

[0 0 1 5] 0 9 : ISlOHDWiCO^t^^ 
OT-«*fflC^fc^*Xx9HDraiaWOCC AG)n MA 

^xx 7 H D3fittora««op c r»«t»«*^ ore* 

B^ltlr^, A N 1 RO'A N 2 ttiE»ttaft&(*S 
*©Jfeffijl^B«!(Dfl««:*LrCiS. AEBHDS? 
feftfi*C[)PCRjg«Bl<DffifflS:^LrTl»S. 3X5 K L 
1 9 1 F 1KC/GUS7 2 -2 1 3 0fi*OPCRI% 
«U->2 9&C/3 OfcfH* ^nfni8M4 8(D 
CAGS1fiSrWL"CC^ 0 

[00161010 : £g&HDW@MCC^T** 



(5) 4$g3¥7 - 67 6 6 1 

8 

CD-*fc::|stf£(CAG)n SffiCD P C RfttirV$>Z>. £ 
S3tcHD#fflS*^«tb/cTy 'J;*7&<D-M<DPCR# 
*M>fS*. AN»*iE«tt»SQBg^a>a5H«:*0. A E 
ttHDStffijBe^-tOttffliTSUTC^. >K 3, 
4. 5, 7 2aaf8ttWigHD<D.a««^***<OPCR 

«w**orc>i. u->2t*HCHD*e*wc-x>-c 

>6»iE*ttffll(M3|M>PCRjBMi«d4VrS. 
10 3^ 5FL191F1 SffGU S 7 2 - 2 1 30 (U- 

^n-eni 8R»4 80CAGSffl4SLt^5. 
[00 1 7] 01 iaWSl 2 : HD£3l£i£acir<h 
*^&ni»fc<c^«Saa«:J:S2o<D«| (7 7 s y 
-) (D(CAG)n SaOPCR^t*^ flfc»14©H 
DSW«/c*iS^MS?:^1-2 o©K(DP C R# 

20 »HtLTt*Sa<r<D*fetttt4 P 1 6 . 3 

SEn*{*Wfc*&# (01 1 <0 3 , H1201) fctftS 

[ 0 0 1 8 ] 0 1 3 : >Pfl^feMO'H D*fefe#±© 
(CAG)n Jg«*{4CD^<Dj:b©T?*€> 0 1 5'0O»fiU 
30 tcm&&(D 4 2 5 H D m&W. SO* 5 4 5 CD*fM£fcfe{* 

itcrsigfSft&cc AG)n JKffimffi^aeco^r©^ 

[00191014, 015: ««&CttUH«>6£ii 
?n^HD^6(*±0(CAG)n S«*fl[©«Otti«t? 

ft*. ©SB (0 14) Stf5att (015) *»e>*n*ti 

{E*Sftac±***o6*vct>4ig 1 34*0 1 3 4W 
1 6 1 HD»a*wcwr4iJtt»***Lrrr>*. 2-0 

-5. 34, p<0. 0001 >• . 

40 [00201016,017,018: 3 OCDg^ft 
S£fi*<7)H D&feft±<D(C AG)nSa#fltO)««:a«C 

x-5HD]5£ (0 1 6 ) [Cusella, J.F.6, Nature 306: 
234-238(1983) ; Wexler, N.S. £ , Nature 326:194-197(1 
987)]. Zm (01 7) Ra'DiS (01 8) [Folstei 
n.S.E.6. Science 229:776-779(1985)] frtl^tl&X: 

CD7 5. 2 5&tf3 5 H D«6#{C«t S*tt^**m 

WDftfBtnU^XiiiUiP^lt ( t,,.,= 1 . 5 8. p 
50 <0. 12). ZlSRC/D»g«ftK:±HD»?&i»i*a 



C6) 

9 

icffljiTS#?fl£mi,/e (■en-ent 41 ., = 6. 73, 

p<0. 0 0 0 lSf>'t <s , = 2. 90, p<0. 00 
4) „ 

[002 1 1 119, 020: ffilSRtfftlfrr 
0(CAG)n J5«fi©tBIWIW&£nrf. 9 (01 9) X 
«S£ (02 0) ©H Difefe&±©MXft£ft)£Tr 

[002 21021: «fM'J>^"fScDNA« 
©HD (CAG)n JKffi©ifi|ig-C£>&. ^24-30 10 
©^*Xl5HD»5A ( 1 -53ft) CCBBT&fiH 1 

(S) RO''J>^ , ^It ( L) ^©DNAIrHDCCA 
G)n S«©PCRti»IiC«fflL//c <> §I/->K*JWS 

[0 02 3] 022 : ffitf$^<^^±K8Mitt<D* 
3<t©t@MK^ ; &^L/-Cl-^„ ^t^6^©^«> f£»j 
3*i/c2 3 4 A©H Difi^+t UT-{Col»rSllL3 

tCo 20 
[0024] »W<P»UC>affl 

[0025] #H£ijffl^cTi&BJt,-Ct,>S I T 1 5 (*. %k 
^4-7-*-D4S 1 8 0MD4S 1 8 2Rfl©5 0 

o Kb -feioo h©pssp^fi*©ae^-c*^„ ^> 

^>^>jtfeT-(*i^j2 1 Okb DNA££/c£\ ftJfeti 30 
ge*53ft-Ct>&#>-?/cifa3 4 8kDa©£>^?€£:3- 
KC/Tl^o ^ ©&¥§£#«, iESthSiatc 
**<,>r4>&< <hfc 1 7<l©^ALjge^?rWrS^CC 
AG)n M; * * F<Ktt*£W L . C ©&"«©& 

BCCi^jt^ijiefiCtl 1 *>6#j3 2©CAG3 

h>^^C*5ti-C^S^WSb/c^(©CAG F 
U 5? f U^f- ^ KSa©#SE(CBSt>^fcb h &&&©& 
&f-C*S©-C. ^>^>^>jt&7-F*i©CAGJg«© 

3 7* 64>& < i 8 6 a t-»©»jc £rif At" 40 

l&SO'. C©<t5^>f>f>ae^t*©CAGfii 
©?fc©ift3 7K«<£ 0 kA^fr&BT'Cif Att^>3=-> 

[0026] a{K^rt©C AGfi«©JWB*r» 

^6&^£ft&c©J: otitic a Gsa*^rrs so 
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£ # a t » »8it?tf ^ «t©k w » . * h s ae^ 

[00 2 7 ] I . ^>»>?->DNA©»P-->^R 

E>V n > g > » > » ^ft©gfeifj 

^>^> h >S^.*^tctet^r^>?>^>D NAR 

T-*#Bt-T i> tctbCDWm b /c#££OT©tt*tcfln*. . 
04. 0 5&J>-06tC^Ti^iJ©tt$8«. *ttXt:Httt 

ot ^M<D^ > ^>9=- > D N AM^ >^ £ 
i'iCSffl^MfAi^yfy^^DNARyA^ 

[0 02 8 ] ^>^>^>©n- F{fciB>U**e*»fe 
■T-5^)4«BFfa©7"o^- 3? -©*OfflJT©fSSI©fci>& 
K. £:©«£ 5£.>\>?->?->£ Wf^Wfca- F"C&5 
ae^E^J©i'ci-->^cJ;-p-C!f -5. C©.*^* 

- F^DNAS^j^tSfST^fcawc^i^c^ftT 

KaiswiBu iffc(i:^>^>^>ae-?xw'e©ae 
^ © c a g ms.<rm.a^n i> x i » s ©^tiae^-4 ^ 
sfrs^';^7— b'lMJsiE (pcr) x»<fc^w 

5? i> U ^ 9=- K-^KSXe i*©^^5rf Uffl-T 4Ci*it*4. 
^P-MtOfc^^V^DNA^Ife^S-ttftfcC. ^ 
> ? > ^ > ^ > > * i> W ifi A* § ft S o 

[0029] wnMWicamb-c^z rae^ig^jj 

^•5fflt§«, DNAX8RNA, J(f*0<«DNA©S 

43- F-r^DNA(cf^orfig{caite-c#. *©*§*?! 

NA. cDNA. alDNA, <BLV-Z<Dffi&t>-&Vifr*> 

A#sn*. iim^ (»«) is*. (« 
na) ^c^©^ir>'^>^>g : ■>^>^•^K*^^as■t^ 

[0 03 0 ] ^>^>^>S£^£^rr-5<t:lf;*.6ft 
m^MWicjlf^-r ?>^I3(C «fc otc©<fc^&7-{^ 



u 

««fc<. fflZ.te(a) CCDi'y^i'lODNA, 0(;ttfS 

4, S5SO'S6(C^L/n^J: 5&fc©. ^ 

^ Hz - 3 z> i> n - > *is]s-r £ 

:/ Sr/Bl^'W^i; ^*-J/3> (DNA : D 

0>) ^©i'P-xoW:/'; tfJX?2>3iM<D">? 

>^>mRNA£-f >fc-KP{C-C8fl§?l/. f#6ftfcg8i? 

* ^ > > ^ >©£U?r§14©?3q£ fco t, > r 3 h 

(c) ^>^>^>£>^R J &^3I-3"$?B£A>6g}sR 
[0 03 1] t F2t£it£^#@4. i5Syl6 0^- 

ni«Pi— (Diem 3- F(/ri>tti>«B£tt. ^ifflS 
t, > r ais & tt ifi-r -5 c i (c <t 9 >- n > * > > d 

?'?^DNA7'n-7'M7Y7'7 'J-CATCC/N 

I HBfr^^P^. 5 IB. 1991, 4-6H£#R8©c 

i] . 09;U*> W/8&D4 S 1 0 7'P-7"«^D->€ 
p T V 2 0 (ATCC 5 7 6 0 5SO'5 7 6 0 4 ) ;H 
5.52 (ATCC 6 1 1 0 7M6 1106) MF 

5. 53 (ATCC6 1108) T?&&„ 

[0 03 2] t h 4 tftfc ^ -f 5 >)-\Z^ 

g£g{C*D 6*1^*59. 7"P~:/£iJi8i-f -SfciWCATC 
C*^A^aft6-C*«5 [thMv^XDNAT'n-y 
M7-f ^'J-OATCC/N I HI?l*i!Bd', 5 
iff, 1991, 72-73M : Sr#flg©C t ] , WitfLL 0 4 N 
S 0 l&O'LLO 4NS 0 2 (ATCC 577 1 9R 
y'ATCC 5 7 7 1 8 ) #C©gW«:Wffl-e&-2> 0 

[0 0 3 3 ] ^BJJiSfc^-r^.iESa^^ *-t?i^J 

^N>^>^>DNA<Die?lJ{i*B^S{crte7KU 
Tfc9 (134. S5&£>'I36) . C©I3?IJ{Cj:«3 . >n> 

T£k:?E$K«t-e©— gB£$W-f S C t Ifi&ZHLte&m-) 
[0 03 4] ;N>^>^>©y^ADNA{*A£*«:#?£ 

t>. S6«c. ccDi^&y^ADNAw:. ^-©ag^ga 
?u© ^#$© 5 • v a ■* - 1> - mm <t -«£ (c . m/xb 

?<3&<D^ > > ^ > 3 «a?»±««8 i Jttc U T 
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[0 03 5] CW^^^^^yy/ADNABS 
tc. >'>>^>^>niRNA©5 -^iSiRMJlS5r3- 

- F-r&«e^i35tJ£ittcA*-rSC£4>-C££. 

0, 3Q«©^ 5 'RO'/X«3 '#0 

io li3 'IMnRMM«fia*S-tf. *ia<E>?rlE^RJ>'^i»?© 

[0 036] y^ADNAtt&^&T^Sm ^{C* 
fe«:4?rffi^-r-5.t \-m^Mm& ?>%m%lC}3ta<D&® 

summits -cm < u, ffiMO'^r^r^mfcf* 
n^r <Dm&#mwz*mi ©ra^tc r > » * > ^ > «e 

yjU^«,^S6SLO'DNAiB?iJ^tcJ;-,rigSb, fHWL 

^ur--fe'nisy : Bai3 iaj&s. {ife©5?i7ur-H2' 

[0037] *>^i>w. ^^(Dm^Mssmmn^^^^ 
^«tc*ne>n-c(,>-5cDNA^*wmL/. c©.*^ 

SIS ffir -5 > n > ^ > > mR N A ©c D N A 3 f - 

3- AXBcDN Alt^-^f^-tCi'P- 

^>^-T-5(C«. DNA!Ei^£jfil!5 

Affi^Jtt, iite©fc*©¥^lSX«te*^> ffiS^C 

40 r^--fe*^S. SO'jUS^: 'J#-i2tC<fc£ii*3tt<i'©jl 
SOf^KiotDNA^^^-fCffAtiCi^-c^ 

5„ c©J:^^^f^&?f ^/c»©*i=feliSlil^cig»i-c 

[0 038 ] A>? L >5 L >DNA^3- K"f SiB^'JR^ 



six. *tt#**m»4«!>**F*w>. m&wnmmt 

[0 03 9] ^>^>?>^>/Si/^ ; £:3- h'T^f £ 

e>K» J: otgs uc^i«ic^A-rsc £** 

f £ fcifXD •? - * - itfc? fcfc^ > * > > D N A £ PI D 

[0040] m^um^-ti-iim^mmicx-yx^. 

[0 04 1] ^th^^^-CfiittS. -feUt 
.X (S . cerevi si ae) ftffifflT £ ©J&*SI£ Lt^iS^ ft? 
£H,>S. •feH^->TiSS^'5X5 KICK, 2 Mif- 
Xtt^©IfjI#£^W?-.5 *>©#&&. C© 

4 . 13 5 Sy'1 6 KS^nT 5 9 W©T 3 ^g2?<J©tt 
fg> X«lsJ0K^-^©<fc5&£>/^»Xt*iigiI£> 
^K£p- h'"T £DNA©&l8l^iJ©tit*B#>6. 7= • 
StfQde novo)t£ft-r£C tot, 

^jf-r £7F$»f£&<*(c*m sot— ©£ *w&^»© 

[0 04 2] D NA^Ci'©^^«. -e©«^|£^ 
J^>^>^>7i<V^<y'^ K£p- h"-f &^>?>?->;3 

^u*^F65?iJi r^iinj^cca^j snt^st!^ 
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i^g5Be?iJ©i^T3Z.«$iJ?a3iTtcg2?iJ*^^ £tt£J:5k: 

rbse?'j£-e©r3- K{biey<j©5 ■^scciiite 3 n/c7'a 

K^O^a©*^^?^- K-r-SmRNAASts^^ 

n. $ fc-e© 2 o© d n a se?ij©3i*s©ttM* 5 
(1)7 u-a->? hggHse*fcSHHir. (2 > mi 

I2?iJ©£>^ft, 7>?--te>XRNA©f&31fjE%WiW 
thi\ X&J (3) DNA»ffi©l£f:tt***#tyfc<,>ate 

oT, 7' P ^t- £ -Sf !$#:-?-© D N A ie?iJ£$£?-C£ £ 
[0043] jt£^©^ted&S&IBfiBM^©iE«ifct4 

&o*^iR©pg^tciia-^-r-5^n'e £ n^i©5 'awE^s. 

C/5 '#flHR (#a- Ktt) ie^ij. mBTATA*'-; 
^X, + + •> f>^E?"J. C A A TiB5>J^ 0 . V 

p * - ^ - ge?"j(c^a u^tih ©Bss «*^b^© ^ p * 
•rnw. x>^>-9--ia?ijx«±i^©r^^^-dt-sB 

ox « -e nfe(±©^ifflita r ^ ^ ^ - * ? a z tc 
&(Dtn^nffi$&itm$i&&*tf t *-rz d nas** 

[0044] jw©»«T?tt. iiii»xtti«s. 

•fe tr s^T xfflliatc r *^bj©^ ^ ^ -*5$s^-c t s 
£-#c©s#rcteJ3-e& s„ ^is? (Baciiius)fii 

"C«. ^fa©DNA©ffiii^*Si^g>a:JS^*5*S 0 th 
«|ja% £' ©^*%?b^{c r £ > / "C jr H ft ffeSi-r 

~>^;uft. 35iRW)««rc, ^S©t 

(AUG) ft^WL/n^^S-^CCtSGT. IKDX^U 
MW&mZfflthJ?*-^ (AUG) 3F>4#^tS 

*>, X»#^b^ct,^©-c*-5TJ;t^ c©<fc^ncM^ 

- K-r^i2?ij©3 -MvmmRv/xmmm.mmi* 
±ib© * p - - > y^acc j; o r a^t s c <t # r * 

so -£©fe^**±pgi5fSFijsjR©/c«). x»**«*t«rtiiua^ 
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©tf y T T-MtZ'n '>gm<DtctbicUft2 ax J: 

mi&istt^w&iz* m^mmx^mt z&wzmt&z. 

(0046] *ffl®*>' , t*KXIfrh3&'«^ > ?- K©9$ 

yj«fi»ix.tf. ^>?->^>^>>'^s ; &?f^iNHfla* i e^ io 
ifc^-a-s/cAcD^^Jneyijt LTmffi?z>b<Dx<bti 
«<fci>. c©<t ^^©^^wtt^afla^© i 

c© J: 5 &Ste*>A*@©iB^JW^S©-^:/*- FB2?U 
[0 04 7] ^J12n/c^>^>^>^>^'d'»»fif* 

©<£#. m«ffltH. i'07hW7v-, 77 20 

F*«arc». tr. ^s^fcjio*ti©:fr£K: 

IS&SI&U HfflflSiSISU A* P v h V=j V 

4-, 0!J;U£DEAE--tz.»UP-X > ***-te;UP- 
ij<i; ysKi/^-s^uffi-rtfP-;*. tFD+ s >7^ 

[0 04 8] I I. RWR0^MBrog>fca&CP">^> 

»>©<gffl 

J£rF©§8ittt b h^tc^^tc^^^rri.^^^ 

fB. #CCCAG&ffi:nb-&©l#0Itf/N>^>^>it<K 

tt5fcfn©*5. ^©t^#^w*&<,>«-e;h<i: 

•3 4>.&*A£r©iS8&JB©t,>t*ftt^*tc& o/ca>^ 



?fB8¥7 - 6 7 6 6 1 

16 

SHE LXtt&gT &©#M $ L «,»ffl(***a^ 3 ti S i ^ 
x.6fi&. 

[0049] ^WOg&Rtf* £ U 

fcns.&;§. x«/n>^> h >egf>»K©#&*.&« 

Offit*i©B5iC*.5jmHH^iLT^»ib. XW^-©*?& 
. /> > ^ > * >it&?-tcteW 3 C A G5ffi3 1* -$t©^ft 

[0 050 ] #^©^>^>^>jB£^-£:3~ Kf £ 
DN A. *NF«:lE?i?t h^>?>^>M£^©IEJWTf 

i ;-->^tc<tot, c©<fc5&.«#©sm&^> 

^ > >®^©#ft©/c#©ai£Buf£»r& <fc' ©BuMm 
l£»T#spJf!£fc:& «3 . ae^rc©ct ; 5Jtcfli(**s^>^> h 

SSji^fc-X »T SS* 5 ^ 6 n -5 J: -5 tc o tc<DX$> 
cociii, ^^^^>^> h>S^l«©S?j^lS* 

w-r zmfo **©>< % > > ^ >®e^© c a gs«©^[ 

A L/tTt/^S ^ > ^ > 5=- > jt£T-*t3i jt£^© + 

*>?^m.mFf-<ochGWM.<D$)i.*mft.-?z>tctb. -e© 
a y p » f a > tf^mz&mt *> c t «^tcwffl-c* 

[o o 5 i ] mtf^^';-^>^Sfc*5t>r«. ffl 

^f4*^©J: i 5^<l^*^SH!tL > (1) TJE^j 
t >I£^©#a. !$K:c©J:5ttjgfE^ 

©1 1 - 34CAG3 f— © TiE^j ©H©^*^^ 

Wr>i-) {CJtLrilSlSSJb/cDNA^P-^fflCiSRFL 
PftWiZV, TiESj DNARO'A#DNA(C*j^-5 

*iMt«{k^- ^ - ^(Dfamcmi^xftrnt? zct&x 

>'N>^>?->mRNA«|5Ifil©7'P--/ 

£flK>r. ^>^> h>S^^Jm©feFiltt©*c>t h 

Sia ^. (, » /d 3 n J: ^ )S iE^ ^ > ? > ^ > m R N A © 
(a) U 'OIRV/Xl* (b)tf >C X i ik$5 0 . #1*Hbf 2. C 

^>^>f->©^«igccj:-7-c. (a)^tasn. 

/XB(b)Stt5C i*i-C^50t*5, /^>^>^ 
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[005 2 5(1) ^)S^>3 1 >5 1 >DNA-9-^X© 
X(*(2) M^&^>?>^>mRNA©lMX;x:wU 

^>aeT-cD#a. -?•©<£ s ftins^©^??^ 

©<£5ftjt<^©&^l</c$jgffjttf? <W*.l*. H»Sk 
[005 3 ] JfiMia?:AfT5»0^ 

ft. «x «#7K*$a. ttsm*^? >^v>^ <cv 

^SA^^-r^OfCffiffl-C*. PCRtfitiX«DNA 20 

X«#fe«r£tt u f - it ^ic «fc 0 fei^-r Sdi 

#-C#5 Eckstein, F.fl, Oligonucleotides 

and Analogue: A Practical Approach, IRS Press at 
Oxford University Press, — a — 3 — i> , 1992 ; Rt>'Kr 
icka.L.J.fS, Nonisotopic DNA Probe Techniques, T 
J^Ti -j 9 ■ 1f>^Xd'a992) ; &#MCDC 30 

i] . 

[0054] ^>?> \->fmM<DH!km.^smt-r^B. 

J&tCttKU £t$L<«. #fSHJl©:£a£^tfrtb-f 5© 

#s £ n * *?& k *j if z> & < *£if jta©5Et:*7S-r <fc 
^ftm^^a^iftSiwKiitft-r*,, ^>^>^>dn 

=f- > > * >;n' #R£-e©.m^£^MT5 it3fc9-ftB$[g 

s<i>*fi-c^Jir#.5giii^o^©j;^^eg-cti^-r 40 

5ft7?&M:#5;*.ti. Breakefield,X.A.<5>, The New Biol 
ogist 3:203-218(1991): Huang.Q.^, Experimental Ne 
urology 115:303-316(1992). WO 9 3/03743R 
OT09 0/0 9 4 4 l<D£Sc^*>iE>f#iE>ft-5 [ttte>« 
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>x«[fc©^tt^ti^t*oP>ri-rt,>& Anti 

sense Strategies, Baserga,R.6 ,$1 , Annals of the N 
av York Academy of Sciences, 660#,1992] „ 
[0055] ^>f>h >»WWi©«B*£*i-r*A 

*a- Ft-&«{^£c©<fc^ft.«>#«cei{£u jfssL 
tfrtcfitt-r^. /N>^>?->T>^-fe>^.RNAjte^ 

©B&tt. c©J:5frjte^fcJ:oT&«£rx37>3 1 

-fe > X R N A £ *<D&m* mW? £ fC;fc#& B$fW 

■c. q$ccc©j:^fe.m#©mtc^$n-&^s^^> 

* > > raR N A ©§i!R£ ffltFT? # ^fir^-C # SSS 
-5 th .ft^K C ©«fc -5 teJB«HtfW&'C S 9 Z -*B 

Breakefield,X.A.6, The New Biologis 
t 3:203-218(1991) ; Huang, Q. h. Experimental Neurol 
ogy 115:303-316(1992), W09 3/0 3 7 4 
0 9 0/0 9 44 l©^*»6»e>Ti -5 [Cti6tt?lffi 

[0 05 6 1 04. i5Mi6Km1-7^i3-F 
fbgd5IJJSi:©^tgW^^>^>^>^>>'^®?:3" F 

-T5DNAis?<j<&«if&-rn(i. ^Am®9ae^>^>^ 
©fei^tt©^ct»t \-mmicm<>tc2txz> i i - 3 4ca 

CAGJga©^. m*.i* 3 7-8 6tcS*T5^>9 : -> 

y > jie^^itc^-r ^. k w©iem-c * s m&z mi 
ia?E{cfi*L/-r^< ©»ig*5^-r5B«^g^/A>^> 

[0057] >»>>?->h Isfg&lm&t&m? &&m<D&& 
!&m<D» > ^ > h > ^MBT S 3 <E> tc 5*S"C 

», c© «t 5 &,*#©^fflja(cs^/u^ > 5^ > > # 

4 D N A - D N A "7'P -:/K:o(,>T!¥$Pc8£BJi L -cc»-5> 
AJ. ^^©DNAil^CBEJtlW^W'r^RNAfeS 

-rnti^ffl-r^cii^-c^s. i^i&e© ^ 

> 5 1 tn& > > 5^ > 3=- > ^ 9 M U ^ 1/ ©SS 
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[o o 5 8 ] ##6iu« i^xommtur, ^>^>^> 

-r&fn&ccffl-r&o »ii/i«ar«. 

KttK?lJ##6«cl3tt©7 5-/»E?iJ«:Wbr*$f3, X 

7 ®£-ecD|§$fg7 S y« (ff*L><(2. 4>tt<±fcl 
0. 15. 2 0Xtt3 0©-?-<Z)l»filT^^») 

[0 059 ] **?80!S*«ctt*y ^n-^;i/tai*av 

tzmnO¥&teJ:r>X^$&&Cti!>svSZ. Wit 
C(D^^tt»T>T*CCBF(ab')2Wr>t-> Fab' Wfjt, 

> (X«*©«lJBW»i»») Cc*t"r*Sitt. Xttffljft*. 
**6X»*ffi©*ffi«:J:^T rjgAff: (t Mb) j star 

■TSC<t36i-C#S (EP^; *y^ffiflrc&£> [Robins 
on,R.R.6 , S[S#fF^HPCT/US 8 6/0 2 2 6 
9 ; Alcira,K., B^ttMFtBIKl 8 4, 1 8 7#; Taniqu 
chi.M., KWftiWHGKl 7 1. 4 96^; Morrison.S. 
L.6 , KMISitHM 1 7 3, 49 4-^- ; Neuberqr.M.S. 30 
6, PCT/±J1MW0 8 6/0 1 5 3 3 ; Cabilly,S.6 , 
RfflftftUHl 2 5, 02 3^; Better, M. 6 .Science 
240:1041-1043(1988) ; Liu.A.Y.6, Proc. Natl .Acad. Sc 
i. , U.S. A. 84:3439-3443(1987) ; Liu.A.Y.6, l.Immuno 
1.139:3521-3526(1987); Sun,L.K.<E>, Proc .Natl .Acad. 
Sci . , U.S. A. 84: 2 14-2 18(1987) ; Nishimura,Y. P> , Cane. 
Res. 47:999-1005(1987) ; Wood, C.R. 6 , Nature 314:446 
-449(1985) ; Shaw6 , J.Natl .Cancer Inst. 80: 1553-155 
9(1988)£#M£>C<t] . . rgAffcj *^ft#£>Ot> 
X<D— JHWttEflftttMDrrison.S.L. [Science 229:1202-12 40 
07(1985)] &O*0i f V.T.6 .BioTechniques 4:214(1986) 

t»«, C D RXttC E A/g& [ Jones, P.T.6 , Nature 3 
21:552-525(1986) : Verhoeyan^ , Science 239:1534(19 
88); Beidler,C.B.6 , 3. Immunol .141:4053-4060(198 

8)] ccjc^rssK-rsci^rtfa. 

[0 06 0 ] 80©JI8«r«. #lfeWtt±fB<D*:^ ^P- 
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^^CAATftS [Campbell, "Monoclonal Antibody Tec 
hnoloqy: Laboratory Techniqus in Biochemistry and 
Molecular Bioloqy," xy-fefcfT • X >*£ijji£, 7 
A^fvU^A, If • *1f-9> K(1984) ; St.CrotKia, 
3. Immunol .Methods 35:1-21(1980)] „ 

[0061] mt*m&T*c ttmztix^&iim 
*r & km . #y k cDStRttR vaatmftK: «t o t 

[0 06 2 ] ^y^P-^;Utn:ftCcB8L/r« v Ml/c 

[Lutz6, Exp. Cell Res. 17 5: 109-12 4 (1988)] . B^fMO 

fcnfcJrCl^S^ffi [Campbell, Monoclonal Antibody Tec 
hnoloqy: Laboratory Techniques in Biochemistry and 
Molecular Bioloqy, 8iHS (1984) ] lfC£-?X<Z:<D 2 ^ X 

* #v#v-j-)i>mmmL 
wr 4 esttos ace raur^^y-^>^-rs 0 

[0 06 3] *»W©3JllOJI8«rB, ±I2CD!n*^^ai 

#CCig^PT*€> Sternberqer^ , l.Histoche 

m.Cytochem.18: 315(1970) ; Bayer 6 , Meth . Enzym . 62 : 30 
8(1979) ; Enqval6, Immunol .109:129(1972) ; Codinq, 
3. Immunol. Meth. 13:215(1976)?:#Be(DC<b ] „ ft&PM 

««rHS-rSfc«>, ^>tfhP, ^>t'^&c>*in situ 
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[0 06 4] *mi<Dt>5 1 r><Dtm-ClZ* ±ffiCDtaf* 

y>d/Ta*SW^*^CH*D^*i [Weir6, "Handb 
ook of Experimental Immunology" 4$B , v ^■JjcJl/ 

7* v9Bi1BL. *v9X7*-V. 3? 
H. 10* (1986) : laclby6, Meth .Enzym.34 7^f ^ >v 10 
^7 • N.Y.C1974)] o *»W©IIfifttaWS-Y > 

•=7 b 4-ttC&mir2>z£i>m$2>. 

[0 06 5] 3 3Bffi*JfflnItttt# 

K^rSStCffBt-CtS-Cft^^. Witt. Hurby6, "Ap 
plication of Synthetic Peptides: Anti sense Peptide 
s", In Synthetic Peptides, A User's Guide, W.H.Fre 20 
eman, NY, 289- 307M (1992), Rc>*Kaspczak6 , Biochem 
istry 28:9230-8(1989)«r#^<DCt„ tfi-^* 

-5c ST. ft— "^F^^F*, ^>*>*>^ 

r# o ^^iffii -cfiafc 

tnwi^ y^x riv*->RtF/xi*t 

U ^;l/*3>S8£S£y^>. 7^^>XBt^^ 
^>ig)^T^o *»i»*S8»"r*«filtR^^tt«WT 30 



[0 06 6] ^ftM 

^>*>F>»^©MH<fc&£«e?-*4p 16.3 

< I T 1 5 ) 5 FBfll#*€>*Q 

4 p 1 6 .3*ffla<b3^6ttSM4L/c7 5(C«KA7t 
3 y-r^T<DHD&fef*_bCC, E£f5Ht?&£#Jl 1 

-U343t:-i:Hfit»(CAG)n saanasfsft 

/Co HD5fcfe{*±«:T3 7&*>'J>U< cb4>8 6(D3f- 
T«IWi(CAG)ngaB, J£ < ft***«»I<D 
a<E^t«HilbttC^3 4 8kDa©Jft£*>><*K<D 50 



DNA-fe^> h^M-^L/TteO, *9B&4 p 1 6 .3 
[0 067 ] r =i -JbRi^iftQSftffltt, «T 

© liJfi Wl k: r 5 1 m 3 n r i > £ o 

HD-fe^>f>: Jlfe«MftRCW«»*©W92K:« 

HHzA/^-f> [Conneallye>, Genomics 5:304-308(198 
9) ; MacDonald6 , Nature Genet. 1:99-103(1992)] IS 

—Hz • • *^ f*;ucc*jc>f 1 34p4>f? 

^^ffiSflStltTC^S [AndersonRtfGusena, In Vitro 
20:856^858(1984)] . ^* Xx^> H D^f*&S, t^t© 
ffijftffltt*** Wt £ U9c ( founder) *>6HD Sfi^F- # 

aesnrc^sio. oooA©i£85Sci(iL»#t?*a 

[Gusella£>, Nature 306:234-238(1983) ; Gusella6, 
Science 225:1320-1326(1984) ; Wexler6 , Nature 326: 
134-197(1987)] . 

[0 06 8] DNA/RNA^Ovf^>y: *£i 
ISrt>e>DNA*iBKU DNA^d? HfcfBflSU IB1 
3nTl^ct^(C^-/^y #>(Xt,ft: [Gusella6, Pro 
c. Natl. Acad. Sci., U.S. A. 76: 52 39-5243 (1979) ; Gusella 

Nature 306:234-238(1983)] . RNAtS, Taylor 
6, Nature Genet. 3:223-227(1992)tefEiS3ftTl>£ 

[0 06 9] vmrnomm : 3 fj»j# 

(D»fiJOflU»S. =1*3 FL 1 9RO'B J 5 6 [AlUtt 
o6, Genomics 9:104-112(1991); Lin6, Somat.Cell M 
ol . Genet . 17 : 481-488(1991) ] 6 <D?fefe# * - * > 

^ y -^ffifflU/c : OtcMK" ^ F H 3 9 - 8 c 
1 0 [Whaley6, Som.Cell Msl .Genet. 17:83-91(199 
l)] fi*(DAlu Pgtt^*^ FOaU^^aX RCFO 
X77^^ • ^^>3^;U • 7^7 h y — *>6lMt3tl/c 
SWMb^ CI -SgO^feft 4=3X3 F^-f^y- (array 
ed flow-sorted chromosome 4 cosmid library) (NM 
87 545) . ->*-*>MS % Sa^URCJf^^*- 
KW©1f-^Uv^a>*ffl^r, ^=1*3 KDNAiO 
*9 hffifltLfcJWHt^ -f • yy 9 r [Nizetc, 

D.6 , Proc.Natl.Acad.Sci.,U.S.A.88 : 3233-32 3 7(199 
1) ; Lehrach,H.6, in Genome Analysis: Genetic and 
Physical Mappinq, lft, Davies.K.E. 5 , IS, 3— Jl/F 
- X^'V >^ ■ • 7*7 h 'J - ' ^'L/^, 199 

i, 39-81] to^-f ^y if^-tf-^3>Ki«fcorff^ 

/Co 3XSFL1C2. L69F7, L2 2 8B6RO' 
L83D3(t YAC^D->YGA2iBli;KW^^ 
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fcZtltc [Bates<E> t Nature Genet. 1:150-187(1992) : 
Baxendale6, Nucleic Acids Res. 19:6651(1991)] a 
HD3XU'GUS72-213 OM-^b'-^P- 

[0 07 0 ] cD N A©*«SC/«Ftt<b : Buckler^ , 
Proc.Natl .Acad.Sci . , U.S. A. 88: 400 5-4009 (1991) It IB $S 10 

^>RCAMffRNA*^^nfct F vA&Z A 
PI I cDNA^^^'J-^^^'J-^yyUfc, c 

Sanqe r 6 , Proc . Natl . Ac ad . Sci . , U . S . A . 74 : 5 46 3- 5467 (1 

977)CcffittStirii*J:5«:uriffi?tJ»Heiyfc. 

X; F62?Ui£^ttMcClatchey£> , Hum. MoT .Genet. 1:521 
-527(1992)tCfB$g3nTC^J:^tCbrtfo/c 0 £ 20 

•^v— X(Dt£$^B, National Center for Biotechnoloqy 
InformationtDBLAST*^ F 9- fcf;**ttJB 
l/Tffo/c [Altschul6, J. Mol. Biol. 215: 403-410(199 
0)] o 

[0 0 7 1 ] (CAG)n EfflOPCRflbe : (C AG) 

5' ATG AAG GCC TTC GAG TCC CTC AAG TCC TTC 3\ R 
If 

5' AAA CTC ACG CTC GOT GCA GCG GCT CCT CAG 3' 30 
PCRtflffltt, ¥ S A D N A 5 Ong, S^-f^-S 
/ig, lOmM Fy*-HCl pH8. 3, 5mM KCl, 
2mM MqCl, , 2 0 0 mM d NTP s > 10%DMS 
O.0.1 MfiL^—y ^(Perfectmatch) [ ^ 

h^^^->] . 2. 5^Ci 3, P-dCTP [ryjb 

^t>]Ml. 2 5*ttTaa#M5-* [^V 
• 7>a^a] «rfflt\ J5tc2gfi2 5 Ml iprtr 
o/c 0 9 4 # CtC 1. 5»J»«lLfc». EtS5**X£K* 
TcD^P^^ACcfi£-D-C1f^^;U?if/c : 4 0 x 
[ 1 '@9 4*C ; 1 '@6 0°C : 2'@7 2 , C] e §PC 40 
RJ5to4?95 Ml £^fi<7)9 5°/o^A7 5 F • P-:r ^ 
>yfe*r#*?U 9 5-Ct?2^ra, fl&SEttTS. f#6 

n/caggjS: 5 %m±# y r * y )\>r % Ky^±r»*f -r 

-So iSiU^X^ FL 1 9 1 F 1 (CAG 18 )*ffl 
C^T?f^/cCCDSJtEcr)PCRjg%«2 4 7 bp *C$>o 
/Co #3£itf^<DtMX£, DNABWtSttlC » 
UC'^L^X^ F**<DPCRjft#K RtfyJkhtc 
gftl-^Ci^l^^^ 2if5>\r • /<> Kitt 

g^HRWttitOflMtacfti^ (D*m&$:<Dm&H DfB 50 
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»©iE«tta**-rtr>y (saso) -rsctrtT-ofc. 

[0 07 2 ] USS^J 5-8 (cfa W £ H DRc>'IE^&fe<* 

TC>4«^(DR8W«*Wr*3te»«C«l#tt{t3tl/cHD 
7r $ V-CD— 5ft>£ff/c [MacDonald.M.E.6 , Nature 
Genet. 1:99-103(1992) ; Conneally, P.M. 6 , Genomics 
5:304-308(1989)] o O/c 1 5 0 V y 5 'J —<D *) % 

B7 5(DHD&fef*£S!55fJL/c [GusellaJ.F. ,6 , Nat 
ure 306:234-238(1983) ; Wexler.N.S. 6 , Nature 326:1 
94-197(1987)] . Z^rS'J -S^D 77 ^ 'J — iL/T 

^11 2 5RO'3 5HD^SM4i/c [Fol stein, S.E. 
€>, Science 229:776-779(1985)] . IE?S&ft6f*tt* 
H D 7 t S U -C0^^(married-ins)Sc>'^H D 7 y 

^TQ«**COOTt^U/cDNAB, y >rC^#-te;U 
^-OXttSrttJfll^^BSSL/ft: [Gusella, J.F.6 , Natu 
re 306:234-238(1983) ; Anderson&tfGusel 1 a In Vitro 
20:856-858(1984)] . ^©^-Xrtt, DNAttS 
tt/MH*6««Ufc. U>/m »r#JllLXttI8©DN 
AMtcfattS P C RMM>Wft(cttfSiaBab6 tift«> 

OTfe, **Wt6B«iSB»*6^l/v MbflFF-*** 
aT4C£rDNA*PSLfc. T^TOD N AGDH D 

ae^- ( c a g ) n smog s * # <m ? - «msihm 

[0 07 3 ] HJStHI 5-8 teettT*«¥flWMff : R 

#rCCctor?fo/c 0 9 5%ft$f^>F£> SAS [SA 
S^>Xf^a-f Inc., SAS/STAT tf 
—#-f F, /<-5/3>6, 4ft. 2t(SAS-/>X^ 
F I nc. "J— , N . C . , 846M , 1989) ] CD — fl9#§ 

^^dftff* WB-T S@»»ffifi 4 mffl ^ /Co 
[0 07 4 ] gjsgy 

Fjg_y 7'^ a->{bx^ v>^A^~r^/c^<Dx^ y > 

J:r>T»S3ti*HD«»i«««H 1 tcSR^tiScfc^tc 
D4 S 1 0RC/D4 S 9 %m<D2. 2MbCC^/c^or 
OS. D4 S 1 8 0M'D4 S 1 8 2fffl<Z>5 0 0 kb -tr 
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r smfflffliftw-r 40»stee»ott 1 / 3 # d 4 s 1 

2 7 R&D 4 S 9 5 fcfcW £^S*f:£S£^ J: o 
"C^&^lt&^ttt: [MacDonald6, Nature Genet. 1:9 
9-103(1992)] o D4S1 80^^D4S95MD4 
S 1 8 2ffl<Di&mm<D$)4 8 0 kb ^J/c^6 4Ii 

5 FSr^gW&tt^fcO. YAC [Baxendalee,, N 
ucleic Acids Res. .19:6651(1991)] #>6CQtf$RR£>\ 

^fe(*4^eq^^ ^ ^ y -R&s hiatc<o$mz 

?:^T^cn6CD3X^ F<D1 6fl^SHCSt 0 & 
*BWfL x*V>*t«S*ffifflU D4S127£BMI 
n&««OADDA, ot-Tf (adducing I 

with, «fattG^p^-/>*v^y>y 
{fcu-fe^*-*^-— ^ae^tBijeo*: (an. 

[0 07 5 ] *»W#6ttCCtCX*y>Wl^iffi*, 
D4 S 1 2 7CCB?Sr4SI^JeP<0^S*<D=i^ 5 Ffc 

#f#e>n, *n«*-x*v>, xb-uccx:^-** 

>^U aXS KL1 3 4B9fr6L18 1 Bl Oit 
9 O0xf V > ^ P - >4S/c„ x + V>y*P 
2 o<D#fi»cD N A^SflJfc^SES ft 
/Co ^SIi^KAfflIi»cDNA^>f^y-?r^+ 
V>i7P->DL 1 1 8F5-U*ffiWX^'J^> 
IT15A^A*3tl/c. IT16A^ 3o<D 
i*V>^P-X DL83D3-8, DL83D3- 
1 R&D L228B6-3 <D?- )V%m^>XtfZAmW& 
KcDNA^^y~***y~->*T5t ifcj: 

O*gtsn/c 0 y--tf>^P^ r^WciO, 
6iiIT15ARO'IT16A«, HD**SOtt 
10-1 1 kb 1RZfi$<DMmOffift , V8>2>tt*&ftl> 

($(D2o<Di*iLiy/c*' ; eg^i*r*)^ y 

TfeOT^^c HCfll 0-1 1 kb @*<D 
thlH (Jfr. MM. »I8* Km. S^fiRAKS*©* 
fi*<DRNA^4>t^ai^n/c 0 

[0 07 6 ] I Tl 5 AStf I T 1 6A^ftfflb, 
fi©IE^f*£A¥T£fctf>* ^<©tHScDNA7 
^y^y-tcr^*-*>^3^/c 0 03 B, IT15 

t s5oocdna^p->4, jim-cm, 
& 0 i t i 5©«jaaawtt^^^ Ka- Fttieyu*s 

WOT4d0, H4, 0 5SI^H6tC7SLri>4J:5«: 1 
8A<Ol^-f5 1 0, 3 6 6gi^:i/c^^o 
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9,4 3 2^<0*-^>IBR»»3 1 6 &S<D *Jf&t4 

#*lt:a F>BC<D»{ft<D2 4 OifiSiifeK&gOT^ 
£ e IT150*>^ia«tt. 3. 1 447^8? 
*£WT6 3 4 8kDa*>/**«<!:«S3ft£. fiO* 
- rf > MWtftft <omj]<DU ets F > Bnjfi&tt & & -f - ^ 

OCBfc^i 3 'WJcDMet^ F>*>6*£* ^Tfc6*T\ X 

io o /h § & z zti&t^*> ^mt§m u £ 

ft&*Pr>fco 

[0077] mtmi 2 

(CAG)n h y ZZl<*?' Fl^gO^M^ 
*©5'*i*«:iac^ I T 1 5EMB20l'i r ^>S:=i*^ 
FLW4CAG h y y'U > h<D2 1 at'-^Wlt 
(05), CCDK?fJ4, 4 p 1 6 .3CCfcW4#*S 

bb?'Jr« (ssr) ©Ht^ci^enn^sy^AE 

WiJttR-TSi. jESfr^*^ KL191F 1B(CA 
G)n S«^»t*5Ci^f 1 8 =Jtr-©h y:/ 
20 h^btl^Ct^!iU/c (05) o COJKH 

ltd COSSR>»t »1 1*6»3 4<DK«# 
ffit*4'>&< ifc 1 7 OcDgiHiO^fiite^ («1 ) 
^L/Co 1 7 3 ©iE#§fe&tt© 9 8 K# 1 1 -24S 
a©JS«fi3ftWl/"C^/c. 2 0(D^6*»2 5 -3 0 
0Sfl«Ht?«ia3n, 2o©jE«»fe#tt*ti*ti3 

J8£#(D:£<*B 8 0 %T*£> ofc. 3 0(D ^ P - >(DS5?U 
30 ft5£#SftC<fcft«, C(D^[»(CAG)n (CS^ 

[0 07 8] ggjfcW3 

iBftHDtBaWrift&ft**©^^? FGUS 
7 2 -2 1 30 <OB9U»3E»W«:cfe 0 , SkA^frJES 
^fijte^O 4>®^tcA#ft h'J^^U*^ FJS«<D 

4 8^f^W?SUfc (07) o PCRt^S^rHD^fe 

ttH D^fe&±<7)(C AG)n RS^iEffttSfis&tfifflW 

[0 07 9 1 B8B, PCR^gA^^^^HD 
jW*<3WH1RO/c*W«6**© y >^***D N A±t1f^> 

50 ->2) ^7A© : f«(^>3 1 5 V 6, 7. 8, 
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lORtfl 1) 6CS^$ftfcC<b£^LTl>fc 0 t©£ 

E*c?MEr4 3o©jEa»!fe»tt, y>7 s ;ux(D?SIiJCD 
l««riBe-r A XK* SPC RSft^iLte 

(ANL AN 2 > AN3) 0 Si:8(OHD^fi«t # 
^^Xx^HD^i^^ F©4 8Kffljg&<fc0fc£ r F>M 
at£«C03fc r^r^- (IsfW) J S<DPCR1^?:4 

©IE^ffit{^©*£^TU-> 1 l<DDNA4i 

-ttPCRj&gJ (AE) *^it % C©C<hBHD&fe{* 10 
(CAG)n Jgffi3W^3er*4C4*7Sl/riiS. 
>6B, $ifi©DNA©tflW:5 tST/hifc XttH 
•©HDW«W«c*fllf*dWUrc»&. U->3. 7, 

iosc;8i«nfn, ffi*cctMx*s*#<&-3fcH 

DftKWPCRj^*£WLTlr>/c. 1/->1 KCttH 

D4$sw&pcRjSW#aottc^, -e-ftBc©^« 

©DNA*** RjiafT»*ffJCcWISna(r*(CAG)n 
a? hjMfffcJcoTMBSft. C CtttMH DSttHB 

e^*KS»«:»tHsn, looi-costtatr-*^ 20 
titMShfc. #cc. c©*««^«c«Wft*p^ 

©2:TCHD*£*FJt8»ffiLft:. SSH©HD©|Bffitt» 
{C4orKfjH©4 0tr*9 l C©3lffl©5&<£©J5RA 

<bHD^6<*±0(CAG)n SfflCOfiS i<DPrf|«:A*A» 

[0O 8 0 3 H9«, WjK#DNA^-#-8^.*c»-3 

^XX7 S$tfi3fe©|g 2 ©Isjflag*© P C R # #t£^ L X 
OS. »o*i©^«ttRCc«ftSnri>4J:^«CHD* 
-tS^tt (U->6 + 7, 10+11, 13+14, 1 
74-18, 23 + 24) X$>*> [Wexler6 , Nature 32 
6:194-197(1987) ] „ &g|© DNAtt> >fVl/&# J©ffi 
*£-C{3i£ft£iE3?$5B (AN 1 , AN2)©l-?©>tf 
SaefSrMLt^/c. W»RU^«© D N A ft*© 40 

Hummmtemw (ae) «r^r, xEWftwsiise?- 

©jg^J: 0 fences < , jkW xccflfcKttSOtt 

CtfcSjlOgSiWLn^i^'CftS. ^<©f@ 

c Rm©?<f xo3Esaffij*3&s»tt»»<oeak:s* so 
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T£fe©r*St <t£^LTl>£ 0 ^tcSSf^ttJ, 
U-> 1 3RC/1 4r» 6ft/dSSr**. C©HD* 
^rfg-^ft© D N A BM$H© H D P C RS^J <fc 0 

[0 08 1 ] 3S£$-C<D<hC3, S^H MacDona1d6, 
Nature Genet. 1:99-103 (1992 )CCT"S"^TJJIJfflCC, 0 

V A X©$5B <fc9feA£&PCRjei???:HD Sfefei*** h 

-*66 6 n tr-j: o fcrt**iMX©«HT*5;w. 

o/co cft6©!£Wr«* iMf->:/av H^WCCJ: 

*rr a e cor i wjt©« s ***** lt i> * c <t wwm l 

fc. HI 0B, £Sttj:HDWmtfi9mStir\,*Zy4 
KA*ffl5fetT4T^ l ) £? - 7 r 5. 'J -©—*«: 
tftHi3n/c^tt*«Lri»4. U->2©DNA£ttip§ 
UfcHD 6* ^X^KGUS7 2-2 1 

3 0£*P->L/c e S&S4 p 1 6 .3^tBSSCCJ:o 
r Pt###» lW5^Xi7HD *«5CC r SSSf 5 ft 

rc^&j:9(c (HBRuia7) % co^r 3 y-©HD 

»»WttPCR«»ttffla«C"!f-/^E»*^L/rc»4. 

[0 08 2] mam 4 
h d k»t sjrjjfeg^M 

(CAG)n Stl©^3^/cJtcHD^^»©a3»«:J: 

©CC'^#ttSttf»***»A:. 0 1 1 StfB 1 2 CCB, 
Cft£>©:7y 3 'J-(D(CAG)n Kffl©PCR#«?©tt 

**sot^4. aijffl©#«ffl*«je-r*4 pie.3 

©»*&jR©R F L PRO'S S R-7-^-(DSS(licS^ 
H D SfefettKMEP* ft W 4 • 

[0083] 7 y 3 1/ - # 1 "CB V HDB. mFMftfi 
f*3 *<hftCCIII- 1 - 3 tCS^CC^ 

ft/c. C©|5JD?fefef*B. iCttrnCLLTl^C^flf^-C 
*4ll- 2CC#?9EU/c 0 PCR^StCCJ:^. n-2S* 
©^fei*3 *ttiElft«H©a«)t:KC>5ia»-CPCRaW 
4i*t4Ci^ffflO/c (*ij3 6C AGaK-) . L 
II- 3&CXIII- 1 CC4JW 4150(0 A G)n J5?S 
tt*ft*ft»4 4R^4 6 a e-SriWbTlfcfclr 

/Co H«©JMbW7y 3 y-#2r»6ft, cent 
Ji^©H Dge»aa<*lll- 2 3WII- i &o*in- 1 ©IB 

(^4 9tC^L/, fi33CAG3t-). 7 7 5y-# 



(16) 

29 

[0 08 4] m m 

<Dfflv&&£\<>5ffim±y?'m&ti^* hj**u:*-*f io 

[0 08 5] RXffijfflfr fnwvna: £©»«©fB 
WkVX<\2 7 .3r<D«»»ffi©ieBtt h FMRiae 

<Dtfc*teBBjiOTl>£ [Fu6, Cell 67:1047-1058(199 

1) ; Kremer^ , Science 252:1711-1714(1991) ; Verkerk 20 

6, Cell 65:904-914(1991)] „ ®J8l§gffi*{¥5 ESWJIls 

^•FJS«: (CTG)n *U V r ^>(mysotonin):?'P r 

[Aslanidis6 , Nature 355:548-551(1992) ; Brook^ , 
Cell 68:799-808(1992); Buxton h , Nature355: 547-548 
(1992) : Fu6 , Science 255:1256-12 59(1992) ; Harley 
h , Lancet339:1125-1128(1992) ; Mahadevan^ , Scienc 
e 255:1253-1255(1992)] „ H D (C A 30 

G)n JR«B I T 1 5 itfe^^n- hMtSS5»JWCC*4C 

^Jrt<D(C AG)n SfRO«^JIKHr*»JI»»e©X 

(D#SWftiIl>~C^& [LaSpada6, Nature 352:77-79 

(1991)] o IfiXffi^aP&VttSRSBtt^x h a ^ -r -©H 

^#B$^t&?g3ft&^#&£ [Fu*E>, Science 2 55: 
1256-1259(1592) ; Yu6 , Am. 3 .Hum. Genet . 50:968-980(1 40 
992) ; Gruner6 , N.Engl .3 .Med. : 476-480(1993) ] . H 

fil&<tfc3b^^gc^o IfeXffi^Sf. 
35§3Stt^* hP^ -f-. RtfHD£J£»-r4£, WHS 
Mtt®5SKSSffi©^iL*f3ijSe? <D*£5£14W:<fc 0 MRS 
nriJO, Sffi©fi30*JWttl£^ttl8»6n«:c* [Bi 

ancalana^ , Hum. Mol .Genet. 1:255-2 58(1992)] 0 

[0 08 6] 85553Rtt^x ha? -c -<£>jK«<Dg3<Dft/; 50 
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§^2l#^S(primordial predisposing mutation)©?? 
1±ifjkV£.LX^2> [Harley6, Am. 3 .Hum. Genet. 49:68-7 
5; Harley£>, Nature 355:545-546(1992) ; Ashizawa, L 
ancet 338:642-643(1991) ; &Z>'Epstein (1991)] « fta 

Xffitiffi-Clt, h y * * u** F£tg<D8fc*^*l/Bt> 

%u&^(om^m<Dmj£2fttcmifti±i>nz> [Rich 

ards^J, Nature Genet. 1:257-260(1992) ; 0udet6 , A 
m. 3. Hum. Genet. 52:297-304(1993)] . I T 1 5 £|SJS 

flc<D4>tt< <hfe l/3BlfljfeCCRIiirafc<Dr*& [Ma 
cDonald6 , Nature Genet. 1:99-103(1992)] . Ctlh<D 
H D «3f *T 4fi*0^«SSE«0*# [ Harpe 
r, J. Med. Genet. 89:365-376(1992)] <h*l<&^bi*£ 

<b t h y * * u * * F J5^©i£^:^S ! J©^fe*-t^^ 
o(D-7t ^ »;-©»«fB, HDiSt?(OJgtt<DJfc*:*SW 

«*5HO±E«:*5(CAG)n JS«©S5 

cn62o©ffiWc*JlirHD»a(*RO'HD<0 

p 1 6 .3WBSt«i r y 3?^U*^FSa©ffc*:tHii 
[0 08 7 ] H Dftfei*±<DiliI¥ffiM^*IS2t- £/c 

Mbb%ffimm>^ft&Wb&tete^tc [MacDonald6, Na 
ture Genet 1:99-103(1992)] « Umfthit* mfc+O) 

^L/Tl^j:C>3ocD^iB0()/^6C^i"CS)o/c [MacDon 
ald^> , Neuron 3 : 183-190 (1989b) I Pri chard h f Am.J.H 
urn. Genet. 50:12 18-1230(1992)] . cn^CD^IB^, * 

ti, 4 p 1 6 .3^ftOv-^7-Cc-ol>riES%*ttai 

wfiiie^»^F3i*--tt 4 p 1 6 .3jfieT-M*wra# 

t,c s 4 p 1 6 .3rtOf60^-*-<D^SfK:*-^i*riE 

«i— &ffl#WHBS. HD4*LtC^l^ fil 6*^88 
<DI^S£WUri>£<h#;L6ft&o C4l6(D 
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[ o o 8 8 ) h dk:~h>t© rmm£.i £wrs<@f*«: 

# & C t im&<DJj& J: 0 4> W&14£tl{ft-r & & 
©■C*f). o« 1} tttS«affi^«r«tf 5t&g#fc<. jfi 
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20 



-f -^Sl^iaC $HZ> [Harly6. Lancet 339:1125-112 
8(1992) ; Tsilfich's^ , Nature Genet. 1:192-195(199 
2)] „ ffcE^HD r *) Z 9 U * * V fcGKD-te i/ J > 

ttGjtAMaft^&Sc: <h#jB6ttT<,>£ [Merrit6.Exc 
erpta Medlca, TA^f-;U^A, 64 5-6 50M (1969)] „ 
[0 08 9] BSXffi^Sf<D^KWFMR 1 itf^T-OiS 
ffiW^rg{t;*lM3tU'Cl>-2> [Pierrettifc , Cell 66:8 
17-822(1991); DeBoulle^ , Nature Genet. 3:31-51(19 

93)] . wfSJiBiffiS5SK§gffi©^ttjie^-f->«. c 

«5155BttJ?^ ha?-( -ft*. 5il©l£*#igi£& 

&A«B££*< «mt2«D6nrc»«ci». H D© r 'J * 

* Utf?- K5«©!fc*©f££^tt^#~XMCtt£< © 
^Ifigt4*i??«Ef £. C©IIf&CC£6*>ft£oki)«:&< 

s*ftc. 4pie. ascMcr^ffite^e^tt©'?* 

)W -^N-^**;U>.m^{*H D©^©€:313-r> I T 
15 mRNA«HD**Sl^C#&-r&t I- 

>j * * u*^ Ksa©ffi;^:«^n*$wrsjt^©m 

^^{C®ttt?* S <t t>5 <fc •) , C ©?£.£ 

©*f 3ijae^©mR N A*»Xtt £ *fM>l©<,>-f 

^a^(c^JI3ti2.^t^^^©»#*5H DOJSBt 
*£i7ni**a/c [Wexler<E>, Nature 326:194-197(198 * 

H DRVJEgJxM-* 4 *©Jtf£ 



* 7) ; Myers6, Am. 3 .Hum. Genet. 45:615-618(1989)] „ fi£ 

U ^©^{JillWi^S. ±imlCCMet3K> 
n Sffl©3 •ffl[J-Cj|acO > >^*jtjg!f$5(c* 

y - sou 2 s l ajtim&$> &„ c com 

J5'fg« 5 ' JMHRflMftfCft 0 . mR N A V-OVC®. 

tiz. s«©S£^*i??ii-rn 

10 «. ^ft£^rt£mRNA©tg]gp. {ig&e*. 

??fitc cfc <0 . H Dl£^i*©^e<J^SjS?Baf&g|J{4A^St) 
•TSCiiCt^r^^^-^W^ifBlb. J57J* I T 1 5 it 
GT©K^gg^eiI{a^a!tt-r CifCt-^rmRNA© 
^$l]^*5®®2tl-5tC^ t©-C*i. ^(C!fe?> 
C©J5«« I T 1 512^1^ fC(4gl,r(, ^755. ^g- 

n*«g®fe?-©l6^K:Jpt-r Suffice J:otHD*i< 

[oo90] c©^^©^^fc*^t3e>-r. ^-nw± 

IB©*Smi^3!>sa< . ^>^>^>t^L//c 1 T 1 

SmRNAXW^>^'^»U'<;l'T-©fPflatcS-^^r h 
U3?^u*^ KJSffl©ffi;5g*5HD©/^H<baSC<!:«^; 
4> 6 L^. h'J^l/ b ELfflDmWDWRl-CIti: I T 1 5 
DNAIE?!J«> GenBank-^-^-^-^CCgitfCii^S*! 

N*ffi^ifi©^u* 5 >sc/7'a y >©{*s*^^. -e 

©r 5 sMrninmta<D$ »*?m£m®L&*ni2-r. 
» > * > * >mmc -ok »r ©@ rc o /c*n t {5j 6 3 

nri>at»„ N^tffi©* y-y;i/ #s y- 
^"o y cn6©r 5 ^^©st^*ggp^$w 

■r^^t5t©*>>'^M<i:©^t4^L>TI,^. C©<t 
h<D&< ©fe©*5DNA*l^^>>'<^WC$>0. 

4 4 3j^sccc^— ©p -r^> • %> v*- 

IS©1$StR^©^ ^ » - o >{c s n r (, »5 . 

[0 09 1 ] 

[SI ] 
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40 



JES^fe(*©i5tSc>'!^ H D 8Sfett©8&tf$g 



^4 8 
4 2-47 
3 0-41 
2 5-30 



0 
0 
2 
2 



0. 
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0 
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(18) #HI¥7 -87 8 B1 

33 34 

g2 4 169 0.98 0 0 

£ if 17_3 1.0 0 ZJ 1. 0 

[0 09 2] mwi 5 MHTcSfti*. zht><om£ts:%!&#±.K-^tcZti 

jEftRCEH D!te&tt±.K.1stt & is*? Ki5«© ■5>JK«©fi£ (34- 38 *<fc ) IJ»MRCf H D^MSlffl 

gg©^ <Dmffl.*-?te<' bomb*) > {St^HDffiHfc^tbriBt^ 

'J - fi*© 4 2 5 H D 8&#<D&&~cm& h , 5 4 5© £&oTl>£. 

jE»»attoH Df-y^'u^ sa©a tr-«t Jt«u t o o 9 6 ] mm* 

n®TKi* 3 ga*jar»*>hTi>fc. iE*»&ftt* 1 1 5 0 ©h d ? r 5 y -s*©&a©g 

1#>63 4©JKa#{i©PCRj^5£fi-r&2 4<I©*f g££<!:«Wck©-C£>£„ SjlO^J^iimgtVtU 

9.71. «W«£3. 21). HDS=fetttt3 7*^ *£^M*W^-5>;fctf>. SiHI©4 P 1 6 .3 ¥fl£«*r 

8 6^(4©JS:ft3l«Ctl^T'5 5 4<S©^-fib/cPCRe -rS3o(Dg;AHDjta.^ [Cusella.J.F.6 . Nature 30 

fj^Jg.*.. -?'©4 J *fiiW4 5#&-C*t/c (¥*£j4 6. 6:234-238(1983) : Wexler.N.S. €> , Nature 326:194-197 

42. &8WBS6. 6 8). (1987); Folstein.S.E. 6, Science 229:776-779(198 

[0 0 9 3] HD&e#©5% 1 3 4M1 6 l«-eft 5)] t**-5T-en^n7 5. 2 5Rt>'3 5<I{££§9S« 

-eneffi3c«5ii«s*r**.2,ci*^p.nTc»-5 0 est (*Ytr>40 ufc <£>©£#:$ Lfc„ s^Mrt©^H 

T&t&©f4#J5«s©#*KK^&j$>§;o>£i?^s 20 D&fe{*##— <Dmwc$>2>icbfrfrt>hT. mvmrn 

tctb. cn6©2-ocD^feft©ffi* ; £8iJ^«:H 1 ©-Mm£mJK«©gg£^u-ct,>s [H 1 6 . m 1 

miBicVav hUt^i. «igft*©*fe*« 3 7 2» 7, B18]. HD Sfcfef*©S:«©^££14 JiS*'* * 

67 3ig{a©5:a©§3£7KU> f©**IB4 4t* Xx?H D]fil#t©— M{Ctel>r*fc®14-C#>£ (0 1 

ofc (¥*344. 9 3. WWB25. 14 6). [*3I-r*HD©ffi$fc« 1 0tttt©W£SO. 

©*femiJSffi*(4 i &3 7^6 8 6 3 f-WLrfc*). iKLT13, 00 OlSKRA] . ^Xi^SftO 

-?-©**:fil«4 8^-C*-3fc <¥i$j4 9 . 14. mm C<D?>7V ^tCfcW&JKagg©^ (**«14 

«^8. 27) . 3(^S*©HD^fe{*©I:t$5 6, ¥fcJ4 8. 26. ©««S9. 3) tt. t*<XO-7 

WK«r>Jt*tt5 5ig{i<fc9fc:*£&i§:a©Sg ; £Wl/-C r S 'J-S*©HD^fe(*©fco<!:A#^-9->^3!pe» 

fco (i 6% n 2%) . c:©c:<?:(i^3S<b«i©ea #6ntefc<ot«*{ca*c-o-ri>«ct». mi 7&t*0i 

C^^j^^-Cv^Bjfifett^iSL/Tl^o 30 8tt. HD^^Xi7lMiKfc»5J:') k^afifctto 

[0 094]|jMlfcf-$©t«J, HDS*fe&#jI ■C^A3tltt<^2'3©^7T 5 •;-©*S* ; Sr^L/T 

£gnri>&^££#DNAv~#-ji$j(©w^cj;o cnf>©77 5 y-o^ttssEft^©^*^ 

rgt{C^gnrt>S2. 3©H«W(C^»rL/c<l*^6 U HD©^St©^W^SJ,(,-cc^<tfR#3n-C^ 

f#fc£fcfe& ; £rOT&l/"Cl,>£ [MacDona1d,M.E.f> , Neuron £ [ Folstein.S.E. h , Science 229:776-779(1985)] . 

3:183-190(1989) : Pritchard.C. h, Am. 3. Hum. Genet. 5 C©^{t<=fc , J. S^SS ©A* ft^S&a*^ 6#HC& 

0:1218-1230(1992)]. Cft6©fflf*te?C#K:lE?5riSH 0. «£n-e*lttidfefi#4 0 &C>*4 9 Sffl^fiLf *ofc. 

rtos^ss^wo-ct^o ^h<Dm&<o^mmm m 1 7©-? 7 5 v-co-mic^M ofi-mi*? 

SftTHftHtf. ®6«HD©^ig30m£*I-$"C<»:#s ^T©HD&fe&©J5agg©#^£^;tiC*ffi£b-C*J 

-C££„ M-^t-.S^*^XAi«MBa^c. ^Wr*S*A J 0 (p<0. 0 00 1). *Sl»tM©¥Jtt*4 2. 0 

S«©«3©^tC»-5<J©^*gl5«J©HD-7T 5 'J-rt 40 4S«m(i-C*ofc («««M2. 8 2). 01 8©H 

©c©<t -5 rm&vmmm*^ D«fitta*fflK»D^*£f 

(,, o iWSKtBittT^fcrt* (p<0. 0 04) . iS^ti^ 

[0 09 5] jltif-fOttAi, HD©rg^jjE ©f ^4 9 . 8 0 («f|gi5. 8 6)?Aofc. 

wxwt r«ffctt3e»*»j ecB8jt-rs*BfflWKiE««c [0097] mmm i 

KBI,aShTiJ6T, *^PIf©^iS^fc«C^C . 013©6 2HD^fef*m h y 5< ^ u*^ Ksa© 

ti6©ll»S*©^feft«HDifflJg'r-5C<!:«-C#^ gg*>. +S^-T?.iS©H D&fe#:±-C«-r&C i*i-C 

Ctif.oMJ rg^Siftj HDiMi *fc. 2 5©fiea©^t©2 0, SO'3 7©5c^G 

WK|lDftfe^r**-2>©-C. -e©^femiHJFS«:iES<!: 50 li^^c^Stt^^^TC^. 5 0 0«±©M^»^ 
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[0 0981 11 9Ryi20«, S.ilb/clIi^tfS-r 
D&feflLh©£ffifi3Rg©tBII814£^0-Cl,> 

5?a©g3#&^LTi>&2 o©s$s»{^W5fS:fe 
i 3©^t#:R3©iiA-e&9. 7^>#j&t>-c 

* o /c. t§ ardw t *> (c 5 -oommmm.^ o *>4>& 

C^bW^L-CfcO. ®m<Dlx&<D&Z <h-£©#g8© 10 
J H£©-£ft<t©1fM1±«r = 0. 95 (p<0. 000 
l)-C*^/c 0 2 5©S81<aiK:*it-t£K«;g3©¥t$l 

[0099] 5<:as*©^fe#-c« > s«gs*i^tL 

T<,»£3 l©(Si©rtgl?«:2 6©:g318A<fc5©fi3$; 
4>T& o fc. C ©-? A X©ig^te©S8&*©?£ieffc-Cg| 

f?3ft£J:«3 fcigTL*'. I|I?3 1 -3S^#{4©$5H-C 
U*>/Jn$ < tti^ ^©ifAfctgiltoK: At 

t,>„ se-^r. ^©sffls? t»©^-«©^ni©+Biia 

14ttr = 0. 35 <p<0. 04) brt>&#>o/c. 3 7 20 

^fc. sdKiSi-?agi3ftfc&A©g3igAi3:4 is« 

cftttS£8.s«©2teK:ifiA>o/c <> atttR 
o'filttG^ittc, 5<:lJSa©^-i'Xi^b©?fe3S«y t 

[ 0 1 0 0 ] H D&fet*©4ttt£Si-Ct&g?Sft£J5«© 

3fc«>. a*«5 HDse^-** ';r-*>e>f#/ci|f-7-D 

N A RVttfc? i> <J > ^3?St© D N A &*<DJgflte*iMB 
Lfc. 02 1 {CjjVfil&Stt. HD&fett©SffiK:o<,>T 30 

« istvmmxfn* d n Aiiaccwt&ffljg!&j& 

IESSSfe^©5a{C«#j5ELttl>Ci*^L'r«,»S. fit 
^ Kt-©t^r«<^Xx7 H D7 ? 3 •;-©— 
*<5. 2 4#>6 3 1 0^©t h-C$>-otc, m# lSO*2 

n-en4 5a&f5 2-cwLr«.»4Hia-c**. fif*3&: 

O'4«HD5fflfi3* i -en-! , n4 6S0 t 4 9t*5 l C 

ft^ti^js-c**. <n*5««ij©2*f©i,>r<i*>-cttc> 

55iJ©|Hjfla£3fe-C&5rt*. 5 2 =i f-©HD5a^Lt 
t>€>. 5ffiW«f^r. fg^DNA&tf'JZ^-ig^DN 40 
A©PCRlfipa(cj:i3 > lE^fefett&JfMJgigJtlslCfc 
©£^x./c„ L*>U '^Af^DN A iit^-T ■£»<!:. 
fit-? D N A **© H D ae^HiJfitfcfe^J©^^;?. 
fc. 5S0i|©-5*3>3o (2. 4*J<fcO-5) fc#fcS2?V 

-C&A/CUS. JPHftgg Incite. ®4>A#&^lfc£7n-r 
fBft3t;}:&«>gl>J5fg£W0Tfci9. ^ftSi^tt-C*^ 
fc. S'J©2 A© K^" — U-€-*iJ: *) t)iUg?©H D$5BB 50 
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©5ffl£«U c©B$*T?«fe^^$$-c*i. 

[0101] 5£«*>6{^3ft;fcHD&fett<iSlSa>6 

tst $ ft /c * ft t © w ©jgfsg s Am t » © a -es&mc tB 

So Cft£fiJgft?tfTT HD&feft©Kffl©63tt 
ift8 5%©£jt*s^£bl,-CU;te. J££©^g& «&<!©& 

XOWztt* *f : fDNA*^©Sffl©ifA{C»-^<> it 
tt©IBfS^i©^©H D hU^^Utf?- K&«©#55« 

-3 TS 1 C 3 ft Z> h 

[0102] 

Sfg©^ 3RD-'%S¥^ra©B3^ 

JS«©g 3 Af £ C £ « H D ©&ffi4p^©ffiT i tB 

l»bT<,>.&a>fcLft&c>. ft~>r. fgffi^©-^- £ £ 

a 1 3(C7^-T2 3 4 AWflf^JCO^rfflil^O/i:. 122 
tt. HDRO'iE^feftfca^/cSftS. «ffi^(C*f 

r-5»sfg©S3*aurc»€,„ HBg. ^£e^«HDJS 

ffi©S3KlSJ:bt«JLri''-5). ^S^i5a©S?i)!)S 
-^K^©K^(C*Sifg^Lr. Pearson©tBM^ 
i[/Tr=-. 75. p<0. 0 0 0 1 W?>nfc. J 

(R J = 0. 61. F=121. 45), C©Ci|J% 
J5«S3 <!: ©P^«cfc •) *iX7C©BS^^* £ C <t 

[0103] £ffii|i{i©^ffi©i&£|i8a-r£^ffi^{C 
«tB^^c^Sii*5*f3 . ISJg^ 15-7 SlrV&zm&O 
3 7 -5 2 ^{4^© h >; * * U ^ KJ5«r ^ 
■C*-5 (HDifi*8 8%) . C©g5H-C«. 
iSffl©g3i©K©— ^K^tc. J: 0 m,X7C<Dtm t 

i§i DjSatt*w e>ft tc ^*g®mu<D® o © 9 5 %m« 

Era1*5± 1 8^-C#^3ft/c„ 5 2 ^l{i©©ffi© 5 ^ 3 

7-c«. ^ffi^^tc*t-rssms3©M^w^^©* 

#©S£3 Gj!p&< (r=-0. 40, p<. 000 
1 ) . C©Citt5 2 #{iJit±©Sffl* J ^ < ©^ ffi&tti 
^Kf^LtU4Ci5:mLri,^„ C©tgA0fcf5H 

K?:*j*S%<t'©t h{c<tor(*tB^-c^ct\ 

[0104] H D«£^©JJ!|a$gK:a-3< t 1 -^©^ 
4ffl»t4Ci©T#43 7-5 2Sfflm(5©ffiffi{C* 
* 1 7 8 ffi^-eti. ^^fflUlilS^frrtC «t 0 . C ©S5H© 
S«©fi 3 i U tc&fcttic ft? £ iljtaiK©* ^ ft 
J^mA^3ft/c (p<0. 05) . e$HS£fefe©HD 
S(S^+ t'J7 S£&H3£&3ft© fc© J: 0 

cc ^7=-^ffi*. x«ejix«5<:^!i5©^e^cffl 

^ t b fcis^©c >f ft (efc i >r iE^: & ^fe<*±©Sffl 

©fi 3 i t ©IffltC «f»1 6 ©ra^4>^t±l 3 titcfr 

[0105] ±^©^i«tt i 3iffltc«fcor*w*ffl®«:'a 
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* [0107] 
K?U<Dg S : 2 0 



1 



*10 



«4>« : -#« 



[0 10 6] 
[ffijftjfg] 

OGCGGCAGAC CGCCATGGCG 20 

[0108] 1S^J## : 2 
g^'JCDgi? : 1 7 
KWI©32 : KM 

AATACGACTC ACTATAG 17 
[0109] IS^J#-^ : 3 
Iffinj<Dg§ : 3 0 

ATGAAGGCCT TCGAGTCCCT CAAGTCCTTC 30 

[0110] ie?ij## : 4 
B^JOSS : 3 0 
IE*J<DM : KM * 

AAACTCACGG TCGCTGCAGC GGCTCCTCAC 30 

[0111] E5»i»# : 5 20^ h^oy- : gMtf 

B3?iJ<DgS : 1 0 3 6 6 ISRJ<D49tt : 

S3*J<DM : KM (A) «F»«r^ri2#: CDS 

♦ (B) SftfiS: 316.. 9748 
TTGCTGrCTG AGGCAGAACC TGCGGGGGGA GGGGCGGGCT GG1TCCCTGG CCAGCCATTG 60 
QCAGACTCCG CAGGCTAGGG CTGTCAATCA TGCTGGCCGG CGTGGCCCCG OCTCCGCCGG 120 
GGCGGCCCCG COCCGCCGG CGCACGTCTG GG/£GCAAGG CGCCCTGGGG GCTGCCGGGA 180 
CGGCTCCAAG ATOGACGCCC GCTCAGGTTC TGCTTTTACC TGCGGCGCAG AGCCCCATTC 240 
ATTGCCCCGG TGCTCAGCGG CGCOGCGAGT CGGCCCGAGG CCTCCGGGGA CTGCCCTGCC 300 
GGGCGGGAGA CCGCC ATG GCG ACC CTG GAA AAG CTG ATG AAG GCC TTC GAG 351 

Met Ala Thr Leu Glu Lys Leu Met Lys Ala Phe Glu 
1 5 10 

TCC CTC AAG TCC TTC CAG CAG CAG CAG CAG CAG GAG CAG CAG CAG CAG 399 
Ser Leu Lys Ser Phe Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 

15 20 25 

CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAA CAG CCG CCA CCG OCG 447 
Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Pro Pro Pro Pro 

30 35 40 

CCG COG CCG CCG CCG OCT CCT CAG CTT CCT CAG CCG CCG CCG CAG OCA 495 
Pro Pro Pro Pro Pro Pro Pro Gin Leu Pro Gin Pro Pro Pro Gin Ala 
45 50 55 60 

CAG CCG CTG CTG CCT CAG CCG CAG CCG CCC CCG CCG CCG CCC CCG CCG 543 
an Pro Leu Leu Pro Gin Pro Gin Pro Pro Pro Pro Pro Pro Pro Pro 

65 70 75 

OCA CCC GGC CCG GCT CTC GCT GAG GAG CCG CTG CAC CGA CCA AAG AAA 591 
Pro Pro Gly Pro Ala Val Ala Glu Glu Pro Leu His Arq Pro Lys Lys 

80 85 90 

GAA CTT TCA GCT ACC AAG AAA GAC CGT CTG AAT CAT TGT CTG ACA ATA 639 
Glu Leu Ser Ala Thr Lys Lys Asp Arq Val Asn His Cys Leu Thr lie 

95 100 105 

TGT GAA AAC ATA GTG GCA CAG TCT CTC AGA AAT TCT CCA GAA TTT CAG 687 
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39 40 
Cys Glu Asn lie Val Ala Gin Ser Val Arq Asn Ser Pro Glu Phe an 

110 115 120 

AAA CTT CTG GGC ATC OCT ATG GAA CTT TTT CTG CTC TGC ACT GAT GAC 735 
Lys Laj Leu Gly He Ala Met Glu Leu Phe Leu Leu Cys Ser Asp Asp 
125 130 135 140 

GCA CfiC TCA GAT GTC AGC ATG CTG GCT GAC GAA TGC CTC AAC AAA GTT 783 
Ala Glu Ser Asp Val Arq Met Val Ala Asp Glu Cys Leu Asn Lys Val 

145 150 155 

ATC AAA GCT TTG ATG CAT TCT AAT CTT CCA AX TTA CAG CTC GAG CTC 831 
He Lys Ala Leu Met Asp Ser Asn Leu Pro Arq Leu Gin Leu Glu Leu 

160 165 170 

TAT AAC GAA ATT AAA AAG AAT GOT GCC CCT CGG ACT TTG CGT GCT GCC 879 
Tyr Lys Glu He Lys Lys Asn Gly Ala Pro Arq Ser Leu Arq Ala Ala 

175 180 185 

CTG TGG AGG TTT GCT CAG CTG GCT CAC CTG GTT CGG CCT CAG AAA TGC 927 
Leu Trp Arq Phe Ala Glu Leu Ala His Leu Val Arq Pro Gin Lys Cys 

190 195 200 

AGG CCT TAC CTG CTC AAC CTT CTG CCG TGC CTG ACT CGA ACA AGC AAG 975 
Arq Pro Tyr Leu Val Asn Leu Leu Pro Cys Leu Thr Arq Thr Ser Lys 
205 210 215 220 

AGA CCC GAA GAA TCA GTC CAG GAG ACC TTG GCT GCA GCT GTT CCC AAA 1023 
Arq Pro Glu Glu Ser Val Gin Glu Thr Leu Ala Ala Ala Val Pro Lys 

225 230 235 

ATT ATC GCT TCT TTT GGC AAT TTT GCA AAT CAC AAT GAA ATT AAG GTT 1071 
He Met Ala Ser Phe Gly Asn Phe Ala Asn Asp Asn Glu He Lys Val 

240 245 2 50 

TTG TTA AAG GCC TTC ATA GOG AAC CTG AAG TCA AGC TCC CCC ACC ATT 1119 
Leu Leu Lys Ala Phe He Ala Asn Leu Lys Ser Ser Ser Pro Thr He 

255 260 265 

CGG CGG ACA GCG GCT GGA TCA GCA CTG AGC ATC TGC CAG CAC TCA AGA 1167 
Arq Arq Thr Ala Ala Gly Ser Ala Val Ser He Cys Gin His Ser Arq 

270 275 280 

AGG ACA CAA TAT TTC TAT ACT TGG CTA CTA AAT GTC CTC TTA GGC TTA 1215 
Arq Thr Gin Tyr Phe Tyr Ser Trp Leu Leu Asn Val Leu Leu Gly Leu 
285 290 295 300 

CTC GTT CCT GTC GAG GAT GAA CAC TCC ACT CTG CTG ATT CTT GGC CTC 1263 
Leu Val Pro Val Glu Asp Glu His Ser Thr Leu Leu He Leu Gly Val 

305 310 315 

CTG CTC ACC CTG AGG TAT TTG GTG CCC TTG CTG CAG CAG CAG GTC AAG 1311 
Leu Leu Thr Leu Arq Tyr Leu Val Pro Leu Leu Gin Gin Gin Val Lys 

320 325 330 

CAC ACA AGC CTG AAA GGC AGC TTC GGA GTG PCA AGG AAA GAA ATG GAA 1359 
Asp Thr Ser Leu Lys Gly Ser Phe Gly Val Thr Arq Lys Glu Met Glu 

335 340 345 

GTC TCT CCT TCT GCA CAC CAG CTT CTC CAG GTT TAT GAA CTG ACG TTA 1407 
Val Ser Pro Ser Ala Glu Gin Leu Val Gin Val Tyr Glu Leu Thr Leu 

350 355 360 

CAT CAT ACA CAG CAC CAA GAC CAC AAT GTT GTG ACC GGA GCC CTG CAG 1455 
His His Thr Gin His Gin Asp His Asn Val Val Thr Gly Ala Leu Glu 
365 370 375 380 
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41 42 
CTG TTG CAG CAG CTC TTC AGA ACG CCT CCA CCC GAG CTT CTG CAA ACC 1503 
Leu Leu Gin Gin Leu Fhe Arq Thr Pro Pro Pro Glu Leu Leu Gin Thr 

385 390 395 

CTG ACC GCA GTC GGG GGC ATT GGG CAG CTC ACC GCT GCT AAG GAG GAG 1551 
Leu Thr Ala Val Gly Gly He Gly Gin Leu Thr Ala Ala Lys Glu Glu 

400 405 410 

TCT GGT GGC CGA AGC OGT AGT GGG ACT ATT GTG GAA CTT ATA GCT GGA 1599 
Ser Gly Gly Arq Ser Arq Ser Gly Ser He Val Glu Leu He Ala Gly 

415 420 42 5 

GGG GGT TCC TCA TGC AGC CCT GTC CTT TCA AGA AAA CAA AAA GGC AAA 1647 
ay Gly Ser Ser Cys Ser Pro Val Leu Ser Arq Lys Gin Lys Gly Lys 

430 435 440 

GTG CTC TTA GGA GAA CAA GAA GCC TTG GAG GAT CAC TCT GAA TCG AGA 1695 
Val Leu Leu Gly Glu Glu Glu Ala Leu Glu Asp Asp Ser Glu Ser Arq 
445 450 455 460 

TCG GAT GTC AGC AGC TCT GGC TTA ACA GCC TCA GTG AAG GAT GAG ATC 1743 
Ser Asp Val Ser Ser Ser Ala Leu Thr Ala Ser Val Lys Asp Glu He 

465 470 475 

AGT GGA GAG CTG GCT GCT TCT TCA GGG CTT TCC ACT CCA GGG TCA GCA 1791 
Ser Gly Glu Leu Ala Ala Ser Ser Gly Val Ser Thr Pro Gly Ser Ala 

480 485 490 

GGT CAT GAC ATC ATC ACA GAA CAG CCA CGG TCA CAC CAC ACA CTG CAG 1839 
Oy His Asp He lie Thr Glu Gin Pro Arq Ser Gin His Thr Leu an 

495 500 505 

GCG GAC TCA CTG GAT CTG GGC AGC TGT GAC TTG ACA AGC TCT GCC ACT 1887 
Ala Asp Ser Leu Asp Leu Ala Ser Cys Asp Leu Thr Ser Ser Ala Thr 

510 515 520 

GAT GGG GAT GAG GAG GAT ATC TTG AGC CAC AGC TCC AGC CAG GTC AGC 1935 
Asp Gly Asp Glu Glu Asp He Leu Ser His Ser Ser Ser Gin Val Ser 
52 5 530 535 540 

GCC CTC CCA TCT GAC GCT GGC ATG GAC CTG AAT GAT GGG ACC CAG GCC 1983 
Ala Val Pro Ser Asp Pro Ala Met Asp Leu Asn Asp Gly Thr Gin Ala 

545 550 555 

TCG TCG CCC ATC AGC GAC AGC TCC CAG ACC ACC ACC GAA GGG CCT CAT 2031 
Ser Ser Pro lie Ser Asp Ser Ser Gin Thr Thr Thr Glu Gly Pro Asp 

560 565 570 

TCA GCT GTT ACC CCT TCA GAC AGT TCT GAA ATT GTG TTA GAC GGT ACC 2079 
Ser Ala Val Thr Pro Ser Asp Ser Ser Glu He Val Leu Asp Gly Thr 

575 580 585 

CAC AAC CAG TAT TTG GGC CTG CAG ATT GGA CAG CCC CAG GAT GAA GAT 2127 
Asp Asn Gin Tyr Leu ay Leu Gin He Gly Gin Pro Gin Asp Glu Asp 

590 595 600 

GAG GAA GCC ACA GGT ATT CTT CCT GAT GAA GCC TCG GAG GCC TTC AGG 2175 
au Glu Ala Thr Gly He Leu Pro Asp Glu Ala Ser Glu Ala Phe Arq 
605 610 615 620 

AAC TCT TCC ATG GCC CTT CAA CAG GCA CAT TTA TTG AAA AAC ATG AGT 2223 
Asn Ser Ser Met Ala Leu Gin Gin Ala His Leu Leu Lys Asn Met Ser 

625 630 635 

CAC TGC AGG CAG CCT TCT GAC AGC AGT GTT GAT AAA TTT GTG TTG AGA 2271 
His Cys Arq Gin Pro Ser Asp Ser Ser Val Asp Lys Phe Val Leu Arq 
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43 44 
640 645 650 

GAT GAA OCT ACT GAA CCG OCT GAT CM GAA AAC AAG CCT TGC COC ATC 2319 
Asp Glu Ala Thr Glu Pro Gly Asp Gin Glu Asn Lys Pro Cys Arq lie 

655 660 665 

AAA GGT GAC ATT GGA CAG TCC ACT GAT GAT GAC TOT GCA CCT CTT GTC 2367 
Lys Gly Asp lie Gly Gin Ser Thr Asp Asp Asp Ser Ala Pro Leu Val 

670 675 680 

CAT TCT GTC CGC CTT TTA TCT GCT TCG TTT TTG CTA ACA GGG GGA AAA 2415 
His Ser Val Arq Leu Leu Ser Ala Ser Phe Leu Leu Thr Gly Gly Lys 
685 690 695 700 

AAT GTG CTG GTT CCG CAC AGG GAT GTG AGG GTC AGC GTG AAG GCC CTG 2463 
Asn Val Leu Val Pro Asp Arq Asp Val Arq Val Ser Val Lys Ala Leu 

705 710 715 

OCC CTC AGC TCT CTG GCA GCA GCT GTG GCC CTC CAC CCG GAA TCT TTC 2511 
Ala Leu Ser Cys Val Gly Ala Ala Val Ala Leu His Pro Glu Ser Hie 

720 725 730 

TTC AGC AAA CTC TAT AAA GTT CCT CTT GAC ACC ACG GAA TAC CCT GAG 2559 
Phe Ser Lys Leu Tyr Lys Val Pro Leu Asp Thr Thr Glu Tyr Pro Glu 

735 740 745 

GAA CfiC TAT GTC TCA GAC ATC TTG AAC TAC ATC GAT CAT GGA GAC CCA 2607 
Glu Gin Tyr Val Ser Asp lie Leu Asn Tyr lie Asp His Gly Asp Pro 

750 755 760 

CAG GTT CGA GGA GCC ACT GCC ATT CTC TCT GGG ACC CTC ATC TGC TCC 2655 
Gin Val Arq Gly Ala Thr Ala lie Leu Cys Gly Thr Leu He Cys Ser 
765 770 775 780 

ATC CTC AGC AGG TCC CGC TTC CAC GTG GGA GAT TGG ATG GGC ACC ATT 2703 
He Leu Ser Arq Ser Arq Phe His Val Gly Asp Trp Met Gly Thr lie 

785 790 795 

AGA ACC CTC ACA GGA AAT ACA TTT TCT TTG GCG GAT TGC ATT CCT TTG 2751 
Arq Thr Leu Thr Gly Asn Thr Phe Ser Leu Ala Asp Cys He Pro Leu 

800 805 810 

CTG CGG AAA ACA CTG AAG GAT GAG TCT TCT GTT ACT TGC AAG TTA GCT 2799 
Leu Arq Lys Thr Leu Lys Asp Glu Ser Ser Val Thr Cys Lys Leu Ala 

815 820 825 

TCT ACA GCT GTG AGG AAC TCT GTC ATG ACT CTC TGC AGC AGC AGC TAC 2847 
Cys Thr Ala Val Arq Asn Cys Val Met Ser Leu Cys Ser Ser Ser Tyr 

830 835 840 

ACT GAG TTA GGA CTG CAG CTG ATC ATC GAT CTG CTG ACT CTG AGG AAC 2895 
Ser Glu Leu Gly Leu Gin Leu lie He Asp Val Leu Thr Leu Arq Asn 
845 850 855 860 

ACT TCC TAT TGG CTG GTG AGG ACA GAG CTT CTG GAA ACC CTT GCA GAG 2943 
Ser Ser Tyr Trp Leu Val Arq Thr Glu Leu Leu Glu Thr Leu Ala Glu 

865 870 875 

ATT GflC TTC AGG CTG GTG AGC TTT TTG GAG GCA AAA GCA GAA AAC TTA 2991 
He Asp Phe Arq Leu Val Ser Phe Leu Glu Ala Lys Ala Glu Asn Leu 

880 885 890 

CAC AGA GGG GCT CAT CAT TAT ACA GGG CTT TTA AAA CTG CAA GAA CGA 3039 
His Arq Gly Ala His His Tyr Thr Gly Leu Leu Lys Leu Gin Glu Arq 

895 900 905 

GTG CTC AAT AAT GTT GTC ATC CAT TTG CTT GGA GAT GAA GAC CCC AGG 3087 
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45 46 
Val Leu Asn Asn Val Val He His Leu Leu Gly Asp Glu Asp Pro Arq 

910 915 920 

GTG CGA CAT GTT GCC GCA GCA TCA CTA ATT AuG CTT GTC CCA AAG CTG 3135 
Val Arq His Val Ala Ala Ala Ser Leu He Arq Leu Val Pro Lys Leu 
925 930 935 940 

TTT TAT AAA TGT GAC CAA GGA CAA OCT GAT CCA GTA GTG GCC GTG GCA 3183 
Phe Tyr Lys Cys Asp Gin Gly Gin Ala Asp Pro Val Val Ala Val Ala 

945 950 955 

AGA GAT CM AGC ACT GTT TAG CTG AM CTT CTC ATG CAT GAG ACG CAG 3231 
Arq Asp Gin Ser Ser Val Tyr Leu Lys Leu Leu Met His Glu Thr an 

960 965 970 

OCT CCA TCT CAT TTC TCC GTC AGC ACA ATA ACC AGA ATA TAT AGA GGC 3279 
Pro Pro Ser His Phe Ser Val Ser Thr He Thr Arq He Tyr Arq Gly 

975 980 985 

TAT AAC CTA CTA CCA AGC ATA ACA GAC GTC ACT ATG GM MT MC CTT 3327 
Tyr Asn Leu Leu Pro Ser He Thr Asp Val Thr Met Glu Asn Asn Leu 

990 995 1000 

TCA AGA GTT ATT GCA GCA GTT TCT CAT CAA CTA ATC ACA TCA ACC ACC 3375 
Ser Arq Val He Ala Ala Val Ser His Glu Leu He Thr Ser Thr Thr 
1005 1010 1015 1020 

ACA GCA CTC ACA TTT GGA TGC TGT GM GCT TTG TGT CTT CTT TCC ACT 3423 
Arq Ala Leu Thr Phe Gly Cys Cys Glu Ala Leu Cys Leu Leu Ser Thr 

1025 1030 1035 

GCC TTC CCA GTT TGC ATT TGG ACT TTA GGT TGG QAC TGT GGA GTG CCT 3471 
Ala Phe Pro Val Cys lie Trp Ser Leu Gly Trp His Cys Gly Val Pro 

1040 1045 1050 

CCA CTG ACT GCC TCA GAT GAG TCT AGG MG AX TGT ACC GTT GGG ATG 3519 
Pro Leu Ser Ala Ser Asp Glu Ser Arq Lys Ser Cys Thr Val Gly Met 

1055 1060 1065 

GCC ACA ATG ATT CTG ACC CTG CTC TCG TCA GCT TGG TTC CCA TTG GAT 3567 
Ala Thr Met He Leu Thr Leu Leu Ser Ser Ala Trp Phe Pro Leu Asp 

1070 1075 1080 

CTC TCA GCC CAT CM CAT GCT TTG ATT TTG GCC GGA MC TTG CTT GCA 3615 
Leu Ser Ala His Gin Asp Ala Leu He Leu Ala Gly Asn Leu Leu Ala 
1085 1090 1095 1100 

GCC ACT GCT CCC AM TCT CTG AGA AGT TCA TGG GCC TCT GM GM GM 3663 
Ala Ser Ala Pro Lys Ser Leu Arq Ser Ser Trp Ala Ser Glu Glu Glu 

1105 1110 1115 

GCC AAC CCA GCA GCC ACC AAG CM GAG GAG GTC TGG CCA GCC CTG GGG 3711 
Ala Asn Pro Ala Ala Thr Lys Gin Glu Glu Val Trp Pro Ala Leu ay 

1120 112 5 1130 

CAC CGG GCC CTG GTG CCC ATG GTG GAG CAG CTC TTC TCT CAC CTG CTG 3759 
Asp Arq Ala Leu Val Pro Met Val Glu Gin Leu Phe Ser His Leu Leu 

1135 1140 1145 

MG GTG ATT MC ATT TGT GCC CAC GTC CTG GAT CAC GTG GCT CCT GGA 3807 
Lys Val He Asn He Cys Ala His Val Leu Asp Asp Val Ala Pro ay 

. 1150 1155 1160 

GCC GCA ATA MG GCA GCC TTG CCT TCT CTA ACA AAC CCC CCT TCT CTA 3855 
Pro Ala He Lys Ala Ala Leu Pro Ser Leu Thr Asn Pro Pro Ser Leu 
1165 1170 1175 1180 
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47 48 
ACT CCC ATC CGA CGA AAG COG AAC GAG AAA CAA CCA GGA GAA CAA OCA 
Ser Pro lie Arg Arg Lys Gly Lys Glu Lys Glu Pro Gly Glu Gin Ala 

1185 1190 1195 

TCT GTA CCG TTG AGT GCC AAG AAA GGC ACT GAG GCC AGT GCA GCT TCT 
Ser Val Pro Leu Ser Pro Lys Lys Gly Ser Glu Ala Ser Ala Ala Ser 

1200 1205 1210 

AGA CAA TCT GAT ACC TCA GGT CCT GTT ACA ACA ACT AAA TCC TCA TCA 
Arq Gin Ser Asp Thr Ser Gly Pro Val Thr Thr Ser Lys Ser Ser Ser 

1215 1220 1225 

CTG GGG AGT TTC TAT CAT CTT CCT TCA TAC CTC AGA CTG CAT GAT GTC 
Leu Gly Ser Phe Tyr His Leu Pro Ser Tyr Leu Arq Leu His Asp Val 

1230 1235 1240 

CTG AAA GCT ACA CAC GCT AAC TAC AAG GTC ACG CTG GAT CTT CAG AAC 
Leu Lys Ala Thr His Ala Asn Tyr Lys Val Thr Leu Asp Leu Gin Asn 
1245 1250 1255 1260 

ACC ACG GAA AAG TTT GGA GGG TTT CTC CGC TCA GCC TTG GAT GTT CTT 
Ser Thr Glu Lys Phe Gly Gly Phe Leu Arq Ser Ala Leu Asp Val Leu 

1265 1270 1275 

TCT CAC ATA CTA GAG CTG GGC ACA CTG CAG GAC ATT GGG AAG TCT GTT 
Ser Gin lie Leu Glu Leu Ala Thr Leu Gin Asp lie Gly Lys Cys Val 

1280 1285 1290 

CAA CAC ATC CTA GGA TAC CTG AAA TCC TGC TTT AGT CGA GAA CCA ATG 
Glu Glu lie Leu Gly Tyr Leu Lys Ser Cys Phe Ser Arq Glu Pro Met 

1295 1300 1305 

ATG GCA ACT CTT TCT GTT CAA CAA TTG TTG AAG ACT CTC TTT GGC ACA 
Met Ala Thr Val Cys Val Gin Gin Leu Leu Lys Thr Leu Phe Gly Thr 

1310 1315 1320 

AAC TTG GCC TCC CAG TTT GAT GGC TTA TCT TCC AAC CCC AGC AAG TCA 
Asn Leu Ala Ser Gin Phe Asp Gly Leu Ser Ser Asn Pro Ser Lys Ser 
1325 1330 1335 1340 

CAA GGC CGA GCA CAG GGC CTT GGC TCC TCC ACT CTG AGG CCA GGC TTG 
Gin Gly Arq Ala Gin Arq Leu Gly Ser Ser Ser Val Arq Pro Gly Leu 

1345 1350 1355 

TAC CAC TAC TGC TTC ATG GGC CCG TAC ACC CAC TTC ACC CAG GCC CTC 
Tyr His Tyr Cys Phe Met Ala Pro Tyr Thr His Phe Thr Gin Ala Leu 

1360 1365 1370 

GCT GAC GCC AGC CTG AGG AAC ATG CTG CAG CCG GAC CAG GAG AAC GAC 
Ala Asp Ala Ser Leu Arq Asn Met Val Gin Ala Glu Gin Glu Asn Asp 

1375 1380 1385 

ACC TCG GGA TGG TTT GAT GTC CTC CAG AAA GTG TCT ACC CAG TTG AAG 
Thr Ser Gly Trp Phe Asp Val Leu Gin Lys Val Ser Thr Gin Leu Lys 

1390 1395 1400 

ACA AAC CTC ACG AGT GTC ACA AAG AAC CGT GCA GAT AAG AAT GCT ATT 
Thr Asn Leu Thr Ser Val Thr Lys Asn Arq Ala Asp Lys Asn Ala He 
1405 1410 1415 1420 

CAT AAT CAC ATT CCT TTG TTT GAA CCT CTT CTT ATA AAA GCT TTA AAA 
His Asn His He Arg Leu Phe Glu Pro Leu Val He Lys Ala Leu Lys 

1425 1430 1435 

CAG TAC ACC ACT ACA ACA TGT CTG CAG TTA CAG AAC CAG GTT TTA CAT 
Gin Tyr Thr Thr Thr Thr Cys Val Gin Leu Gin Lys Gin Val Leu Asp 
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49 50 
1440 1445 1450 

TTG CTG GCG CAG CTG GTT CAG TTA CGG GTT MT TAC TGT CTT CTG GAT 4719 
Leu Leu Ala Gin Leu Val Gin Leu Arq Val Asn Tyr Cys Leu Leu Asp 

1455 1460 1465 

TCA GAT CAG GTG TTT ATT GGC TTT CTA TTG AAA CAG TTT GAA TAC ATT 4767 
Ser Asp Gin Val Phe He Gly Phe Val Leu Lys Gin Phe Glu Tyr He 

1470 1475 1480 

CAA CTG GGC CAG TTC AGG GAA TCA GAG GCA ATC ATT CCA AAC ATC TTT 4815 
Glu Val Gly Gin Phe Arq Glu Ser Glu Ala He He Pro Asn lie Phe 
1485 1490 1495 1500 

TTC TTC TTG CTA TTA CTA TCT TAT GAA CGC TAT CAT TCA AAA CAG ATC 4863 
Phe Phe Leu Val Leu Leu Ser Tyr Glu Arq Tyr His Ser Lys Gin He 

1505 1510 1515 

ATT GCA ATT CCT AAA ATC ATT CAG CTC TGT GAT GGC ATC ATG GCC ACT 4911 
He Gly lie Pro Lys He He Gin Leu Cys Asp Gly He Met Ala Ser 

1520 1525 1530 

GGA AGG AAG GCT GTG ACA CAT GCC ATA CCG GCT CTG CAG CCC ATA GTC 4959 
Cly Arq Lys Ala Val "Thr His Ala He Pro Ala Leu Gin Pro He Val 

1535 1540 1545 

(AC GAC CTC TTT GTA TTA AGA GGA ACA AAT AAA GCT GAT GCA GGA AAA 5007 
His Asp Leu Phe Val Leu Arq Gly Thr Asn Lys Ala Asp Ala Gly Lys 

1550 1555 1560 

GAG CTT GAA ACC CAA AAA GAG CTG CTG CTG TCA ATG TTA CTG AGA CTC 5055 
Glu Leu Glu Thr Gin Lys Glu Val Val Val Ser Met Leu Leu Arq Leu 
1565 1570 1575 1580 

ATC CAG TAC CAT CAG CTG TTG GAG ATG TTC ATT CTT CTC CTG CAG CAG 5103 
He Gin Tyr His Gin Val Leu Glu Met Phe lie Leu Val Leu Gin Gin 

1585 1590 1595 

TGC CAC AAG GAG AAT CAA GAC AAG TGG AAG GGA CTC TCT CGA CAG ATA 5151 
Cys His Lys Glu Asn Glu Asp Lys Trp Lys Arq Leu Ser Arq Gin lie 

1600 1605 1610 

GCT GAG ATC ATC CTC OCA ATG TTA GCC AAA CAG CAG ATG CAC ATT CAC 5199 
Ala Asp lie He Leu Pro Met Leu Ala Lys Gin Gin Met His lie Asp 

1615 1620 1625 

TCT CAT GAA GCC CTT GGA CTG TTA AAT ACA TTA TTT GAG ATT TTG GCC 5247 
Ser His Glu Ala Leu Cly Val Leu Asn Thr Leu Phe Glu He Leu Ala 

1630 1635 1640 

GCT TCC TCC CTC CCT CCG CTA GAC ATG CTT TTA CGG ACT ATG TTC GTC 5295 
Pro Ser Ser Leu Arq Pro Val Asp Met Leu Leu Arq Ser Met Phe Val 
1645 1650 1655 1660 

ACT CCA AAC ACA ATG GCG TCC CTG AGC ACT CTT CAA CTG TGG ATA TCG 5343 
Thr Pro Asn Thr Met Ala Ser Val Ser Thr Val Gin Leu Trp He Ser 

1665 1670 1675 

GGA ATT CTG GCC ATT TTG AGG GTT CTG ATT TCC CAG TCA ACT GAA CAT 5391 
Gly He Leu Ala He Leu Arq Val Leu He Ser Gin Ser Thr Glu Asp 

1680 1685 1690 

ATT CTT CTT TCT CCT ATT CAG GAG CTC TCC TTC TCT CCG TAT TTA ATC 5439 
lie Val Leu Ser Arq He Gin Glu Leu Ser Phe Ser Pro Tyr Leu He 

1695 1700 1705 

TCC TCT ACA CTA ATT AAT AGG TTA AGA GAT GGG CAC ACT ACT TCA ACG 5487 
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51 52 
Ser Cys Thr Val He Asn Arq Leu Arq Asp Gly Asp Ser Thr Ser Thr 

1710 1715 1720 

CTA GAA CAA CAC ACT GAA GOG AAA CAA ATA AAG AAT TTG CCA GAA GAA 5535 
Leu Glu Glu His Ser Glu Gly Lys Gin He Lys Asn Leu Pro Glu Glu 
1725 1730 1735 1740 

ACA TTT TCA AGG TTT CTA TTA CAA CTG GTT GGT ATT CTT TTA GAA CAC 5583 
Thr Phe Ser Arq Phe Leu Leu Gin Leu Val Gly He Leu Leu Glu Asp 

1745 1750 1755 

ATT GTT ACA AAA CAG CTG AAG GTG GAA ATG ACT GAG CAG CAA CAT ACT 5631 
He Val Thr Lys Gin Leu Lys Val Glu Met Ser Glu Gin Gin His Thr 

1760 1765 1770 

TTC TAT TGC CAG GAA CTA GGC ACA CTG CTA ATG TGT CTG ATC CAC ATC 5679 
Phe Tyr Cys Gin Glu Leu Gly Thr Leu Leu Met Cys Leu lie His He 

1775 1780 1785 

TTC AAG TCT GGA ATG TTC CGG AGA ATC ACA GCA GCT GCC ACT AGG CTG 5727 
Phe Lys Ser Gly Met Phe Arq Arq He Thr Ala Ala Ala Thr Arq Leu 

1790 1795 1800 

TTC CGC ACT GAT GGC TGT GGC GGC ACT TTC TAC ACC CTG GAC AGC TTG 5775 
Phe Arq Ser Asp Gly Cys Gly Gly Ser Phe Tyr Thr Leu Asp Ser Leu 
1805 1810 1815 1820 

AAC TTG CGG GCT CGT TCC ATG ATC ACC ACC CAC CCG GCC CTG GTG CTG 5823 
Asn Leu Arq Ala Arq Ser Met He Thr Thr His Pro Ala Leu Val Leu 

1825 1830 1835 

CTC TGG TGT CAG ATA CTG CTG CTT GTC AAC CAC ACC GAC TAC CGC TGG 5871 
Leu Trp Cys Gin He Leu Leu Leu Val Asn His Thr Asp Tyr Arq Trp 

1840 1845 1850 

TGG GCA GAA GTG CAG CAG ACC CCG AAA AGA CAC ACT CTG TCC AGC ACA 5919 
Trp Ala Glu Val Gin Gin Thr Pro Lys Arq His Ser Leu Ser Ser Thr 

1855 1860 1865 

AAG TTA CTT ACT CCC CAG ATG TCT GGA GAA GAG GAG GAT TCT GAC TTG 5967 
Lys Leu Leu Ser Pro Gin Met Ser Gly Glu Glu Glu Asp Ser Asp Leu 

1870 1875 1880 

OCA GCC AAA CTT GGA ATG TGC AAT AGA GAA ATA CTA CGA AGA GGG GCT 6015 
Ala Ala Lys Leu Gly Met Cys Asn Arq Glu He Val Arq Arq Gly Ala 
1885 1890 1895 1900 

CTC ATT CTC TTC TGT GAT TAT GTC TGT CAG AAC CTC CAT GAC TCC GAG 6063 
Leu He Leu Phe Cys Asp Tyr Val Cys Gin Asn Leu His Asp Ser Glu 

1905 1910 1915 

CAC TTA ACG TGG CTC ATT CTA AAT CAC ATT CAA GAT CTG ATC AGC CTT 6111 
His Leu Thr Trp Leu He Val Asn His He Gin Asp Leu He Ser Leu 

1920 1925 1930 

TCC CAC GAG CCT CCA CTA CAG GAC TTC ATC ACT GCC GTT CAT CGG AAC 6159 
Ser His Glu Pro Pro Val Gin Asp Phe He Ser Ala Val His Arq Asn 

1935 1940 1945 

TCT GCT GCC AGC GGC CTG TTC ATC CAG GCA ATT CAC TCT CGT TGT GAA 6207 
Ser Ala Ala Ser Gly Leu Phe He Gin Ala He Gin Ser Arq Cys Glu 

1950 1955 1960 

AAC CTT TCA ACT CCA ACC ATG CTG AAG AAA ACT CTT CAG TGC TTG GAG 6255 
Asn Leu Ser Thr Pro Thr Met Leu Lys Lys Thr Leu Gin Cys Leu Glu 
1965 1370 1975 1380 
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53 54 
OX ATC CAT CTC AGC CAG TCG GGA GCT GTG CTC ACG CTG TAT GTG CAC 6303 
Gly lie His Leu Ser Gin Ser Gly Ala Val Leu Thr Leu Tyr Val Asp 

1985 1990 1995 

AGG CTT CTG TGC ACC CCT TTC CGT CTG CTG GCT CGC ATG GTC GAC ATC 6351 
Arq Leu Leu Cys Thr Pro Phe Arq Val Leu Ala Arq Met Val Asp He 

2000 2005 2010 

CTT GCT TGT CGC CGG GTA GM ATG CTT CTG GCT GCA AAT TTA CAG AGC 6399 
Leu Ala Cys Arq Arq Val Glu Wet Leu Leu Ala Ala Asn Leu Gin Ser 

2015 2020 2025 

AGC ATG GCC CAG TTG CCA ATG GAA GAA CTC AAC ACA ATC CAG GAA TAC 6447 
Ser Met Ala Gin Leu Pro Met Glu Glu Leu Asn Arq He Gin Glu Tyr 

2030 2035 2040 

CTT CAG AGC AGC GGG CTC GCT CAG AGA CAC CAA AGG CTC TAT TCC CTG 6495 
Leu Gin Ser Ser Gly Leu Ala Gin Arq His Gin Arq Leu Tyr Ser Leu 
2045 2050 2055 2060 

CTG GAC AGG TTT CGT CTC TCC ACC ATG CAA GAC TCA CTT ACT CCC TCT 6543 
Leu Asp Arq Phe Arq Leu Ser Thr Met Gin Asp Ser Leu Ser Pro Ser 

2065 2070 2075 

CCT CCA GTC TCT TCC CAC COG CTG GAC GGG GAT GGG CAC GTG TCA CTG 6591 
Pro Pro Val Ser Ser His Pro Leu Asp Gly Asp Gly His Val Ser Leu 

2080 2085 2090 

CAA ACA GTG ACT CCG CAC AAA GAC TGG TAC GTT CAT CTT GTC AAA TCC 6639 
Glu Thr Val Ser Pro Asp Lys Asp Trp Tyr Val His Leu Val Lys Ser 

2095 2100 2105 

CAG TGT TGG ACC AGG TCA GAT TCT GCA CTG CTG GAA GGT GCA GAG CTG 6687 
Gin Cys Trp Thr Arq Ser Asp Ser Ala Leu Leu Glu Gly Ala Glu Leu 

2110 2115 2120 

GTG AAT CGG ATT CCT GCT GAA GAT ATG AAT GCC TTC ATG ATG AAC TCG 6735 
Val Asn Arq He Pro Ala Glu Asp Met Asn Ala Phe Met Met Asn Ser 
2125 2130 2135 2140 

CAG TTC AAC CTA AGC CTG CTA GCT CCA TGC TTA AGC CTA GGG ATG ACT 6783 
Glu Phe Asn Leu Ser Leu Leu Ala Pro Cys Leu Ser Leu Gly Met Ser 

2145 2150 2155 

CAA ATT TCT GGT GGC CAG AAG AGT GCC CTT TTT GAA GCA GCC CGT CAG 6831 
Glu He Ser Gly Gly Gin Lys Ser Ala Leu Phe Glu Ala Ala Arq Glu 

2160 2165 2170 

GTG ACT CTG GCC CGT GTG AGC GGC ACC GTG CAG CAG CTC CCT GCT GTC 6879 
Val Thr Leu Ala Arq Val Ser Gly Thr Val Gin Gin Leu Pro Ala Val 

2175 2180 2185 

CAT CAT GTC TTC CAG GCC GAG CTG CCT GCA CAG CCG GCG GCC TAC TGG 6927 
His His Val Phe Gin Pro Glu Leu Pro Ala Glu Pro Ala Ala Tyr Trp 

2190 2195 2200 

AGC AAG TTG AAT GAT CTG TTT GGG GAT GCT GCA CTG TAT CAG TCC CTG 6975 
Ser Lys Leu Asn Asp Leu Phe Gly Asp Ala Ala Leu Tyr Gin Ser Leu 
2205 2210 2215 2220 

GCC ACT CTG GCC CGG GCC CTG GCA CAG TAC CTG CTG GTG GTC TCC AAA 7023 
Pro Thr Leu Ala Arq Ala Leu Ala Gin Tyr Leu Val Val Val Ser Lys 

2225 2230 2235 

CTG CCC AGT CAT TTG CAC CTT CCT CCT GAG AAA CAC AAG GAC ATT GTG 7071 
Leu Pro Ser His Leu His Leu Pro Pro Glu Lys Glu Lys Asp He Val 
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55 56 
2240 2245 2250 

AAA TTC GTG CTG GCA ACC CTT GAG GCC CTG TCC TGG CAT TTG ATC CAT 7119 
Lys Phe Val Val Ala Thr Leu Glu Ala Leu Ser Trp His Leu He His 

2255 2260 2265 

GAG CAG ATC CCG CTG ACT CTG GAT CTC CAG GCA GGG CTG GAC TCC TCC 7167 
Glu Gin lie Pro Leu Ser Leu Asp Leu Gin Ala Gly Leu Asp Cys Cys 

2270 2275 2280 

TGC CTG GCC CTG CAG CTG CCT GGC CTC TGG AX GTG CTC TCC TCC ACA 7215 
Cys Leu Ala Leu Gin Leu Pro Gly Leu Trp Ser Val Val Ser Ser Thr 
2285 2290 2295 2300 

GAG TTT GTG ACC CAC GCC TGC TCC CTC ATC TAC TCT CTG CAC TTC ATC 7263 
. Glu Phe Val Thr His Ala Cys Ser Leu He Tyr Cys Val His Phe lie 

2305 2310 2315 

CTG GAG GCC CTT GCA CTG CAG CCT GGA GAG CAG CTT CTT ACT CCA CAA 73UL 
Leu Glu Ala Val Ala Val Gin Pro Gly Glu Gin Leu Leu Ser Pro Glu 

2320 2325 2330 

AGA AGG ACA AAT ACC OCA AAA GCC ATC AGC GAG GAG GAG GAG GAA CTA 7359 
Arg Arg Thr Asn Thr Pro Lys Ala He Ser Glu Glu Glu Glu Glu Val 

2335 2340 2345 

GAT CCA AAC ACA CAG AAT CCT AAG TAT ATC ACT GCA GCC TCT GAG ATG 7407 
Asp Pro Asn Thr Gin Asn Pro Lys Tyr He Thr Ala Ala Cys Glu Met 

2350 2355 2360 

CTG GCA GAA ATG CTG GAG TCT CTG CAG TCG CTG TTG GCC TTG GGT CAT 7455 
Val Ala Glu Met Val Glu Ser Leu Gin Ser Val Leu Ala Leu Gly ffis 
2365 2370 2375 2380 

AAA AGG AAT AGC GGC CTG CCG CCG TTT CTC ACG CCA TTG CTC AGG AAC 7503 
Lys Arg Asn Ser Gly Val Pro Ala Phe Leu Thr Pro Leu Leu Arg Asn 

2385 2390 2395 

ATC ATC ATC AGC CTG GCC CGC CTG CCC CTT CTC AAC AGC TAC ACA OCT 7551 
He He He Ser Leu Ala Arg Leu Pro Leu Val Asn Ser Tyr Thr Arg 

2400 2405 2410 

CTG CCC CCA CTG GTG TGG AAG CTT GGA TGG TCA CCC AAA CCG GGA OGG 7599 
Val Pro Pro Leu Val Trp Lys Leu Gly Trp Ser Pro Lys Pro Gly ay 

2415 2420 2425 

CAT TTT GGC ACA GCA TTC CCT GAG ATC CCC CTG GAG TTC CTC CAG CAA 7647 
Asp Phe Gly Thr Ala Phe Pro Glu He Pro Val Glu Phe Leu Gin Glu 

2430 2435 2440 

AAG GAA CTC TTT AAG GAG TTC ATC TAC CGC ATC AAC ACA CTA GGC TGG 7695 
Lys Glu Val Phe Lys Glu Phe He Tyr Arg He Asn Thr Leu Gly Trp 
2445 2450 2455 2460 

ACC ACT CCT ACT CAG TTT GAA GAA ACT TGG GCC ACC CTC CTT GCT CTC 7743 
Thr Ser Arg Thr Gin Fhe Glu Glu Thr Trp Ala Thr Leu Leu Gly Val 

2465 2470 2475 

CTG CTG ACG CAG CCC CTC GTG ATG GAG CAG GAG GAG AGC CCA CCA CAA 7791 
Leu Val Thr Gin Pro Leu Val Met Glu Gin Glu Glu Ser Pro Pro Glu 

2480 2485 2490 

GAA GAC ACA GAG AGG ACC CAG ATC AAC CTC CTG GCC CTG CAG GCC ATC 7839 
Glu Asp Thr Glu Arg Thr Gin He Asn Val Leu Ala Val Gin Ala He 

2495 2500 2505 

ACC TCA CTG GTG CTC ACT GCA ATG ACT CTG CCT CTG GCC GGC AAC OCA 7887 
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57 58 
Thr Ser Leu Val Leu Ser Ala Met Thr Val Pro Val Ala Gly Asn Pro 

2510 2515 2520 

OCT GTA AGC TGC TTG GAG CAG CAG CCC CGG AAC AAG CCT CTG AAA GCT 7935 
Ala Val Ser Cys Leu Glu Gin Gin Pro Arq Asn Lys Pro Leu Lys Ala 
2525 2530 2535 2540 

CTC GAC ACC AGG TTT GGG AGG AAG CTG AGC ATT ATC AGA GGG ATT GTG 7983 
Leu Asp Thr Arq Phe Gly Arq Lys Leu Ser He He Arq Gly He Val 

2545 2550 2555 

CAG CM GAG ATT CM GCA ATG GTT TCA MG AGA GAG MT ATT GCC ACC 8031 
Glu Gin Glu He Gin Ala Met Val Ser Lys Arq Glu Asn He Ala Thr 

2560 2565 2570 

CAT CAT TTA TAT CAG GCA TGG GAT CCT CTC CCT TCT CTG TCT CCG GCT 8079 
His His Leu Tyr Gin Ala Trp Asp Pro Val Pro Ser Leu Ser Pro Ala 

2575 2580 2585 

ACT ACA GGT GCC CTC ATC AGC CAC GAG MG CTG CTG CTA CAG ATC MC 8127 
Thr Thr Gly Ala Leu He Ser His Glu Lys Leu Leu Leu Gin He Asn 

2590 2595 2600 

CCC GAG CGG GAG CTG GGG AGC ATG AGC TAC AM CTC GGC CAG CTG TCC 8175 
Pro Glu Arq Glu Leu Gly Ser Met Ser Tyr Lys Leu Gly Gin Val Ser 
2605 2610 2615 2620 

ATA CAC TCC GTG TGG CTG GGG MC AGC ATC ACA CCC CTG AGG GAG CAG 8223 
He His Ser Val Trp Leu Gly Asn Ser He Thr Pro Leu Arq Glu Glu 

2625 2630 2635 

GM TGG GAC GAG GM CAG GAG GAG GAG GCC GAC GCC CCT GCA CCT TCG 8271 
Glu Trp Asp Glu Glu Glu Glu Glu Glu Ala Asp Ala Pro Ala Pro Ser 

2640 2645 2650 

TCA CCA CCC ACG TCT OCA CTC MC TCC AGG AM CAC CGG GCT GGA GTT 8319 
Ser Pro Pro Thr Ser Pro Val Asn Ser Arq Lys His Arq Ala Gly Val 

2655 2660 2665 

CAC ATC CAC TCC TGT TCG CAG TTT TTG CTT GAG TTG TAC AGC CGC TGG 8367 
Asp He His Ser Cys Ser Gin Phe Leu Leu Glu Leu Tyr Ser Arq Trp 

2670 2675 2680 

ATC CTG CCG TCC AGC TCA GCC AGG AGG ACC CCG GCC ATC CTG ATC ACT 8415 
He Leu Pro Ser Ser Ser Ala Arq Arq Thr Pro Ala He Leu lie Ser 
2685 2690 2695 2700 

CAG GTG CTC AGA TCC CTT CTA GTG CTC TCA GAC TTG TTC ACC GAG CGC 8463 
Glu Val Val Arq Ser Leu Leu Val Val Ser Asp Leu Phe Thr Glu Arq 

2705 2710 2715 

MC CAG TTT GAG CTG ATG TAT GTG ACG CTG ACA GM CTG CGA AGG GTG 8511 
Asn Gin Phe Glu Leu Met Tyr Val Thr Leu Thr Glu Leu Arq Arq Val 

2720 2725 2730 

CAC CCT TCA GM GAC GAG ATC CTC GCT CAG TAC CTG GTG CCT GCC ACC 8559 
His Pro Ser Glu Asp Clu He Leu Ala Gin Tyr Leu Val Pro Ala Thr 

2735 2740 2745 

TGC AfiC GCA GCT GCC GTC CTT GGG ATG GAC MG GCC GTG GCG GAG CCT 8607 
Cys Lys Ala Ala Ala Val Leu Gly Met Asp Lys Ala Val Ala Glu Pro 

2750 2755 2760 

GTC AGC CGC CTG CTG GAG AGC ACG CTC AGG AGC AGC CAC CTG CCC AGC 8655 
Val Ser Arq Leu Leu Glu Ser Thr Leu Arq Ser Ser His Leu Pro Ser 
2765 2770 2775 2780 
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59 60 
AGG CTT GGA CCC CTG CAC GGC ATC CTC TAT GTG CTG GAG TGC GAC CTG 8703 
Arq Val Gly Ala Leu His Gly lie Leu Tyr Val Leu Glu Cys Asp Leu 

2785 2790 2795 

CTG GAC GAC ACT GCC MG CAG CTC ATC CCG GTC ATC AGC GAC TAT CTC 8751 
Leu Asp Asp Thr Ala Lys Gin Leu lie Pro Val lie Ser Asp Tyr Leu 

2800 2805 2810 

CTC TCC MC CTG AAA GGG ATC GCC CAC TGC GTG AAC ATT CAC AGC CAG 8799 
Leu Ser Asn Leu Lys Cly lie Ala His Cys Val Asn lie His Ser Cln 

2815 2820. 2825 

CAG CflC GTA CTG GTC ATG TGT GCC ACT GCG TTT TAC CTC ATT GAG AAC 8847 
Gin His Val Leu Val Met Cys Ala Thr Ala Phe Tyr Leu He Glu Asn 

2830 2835 2840 

TAT CCT CTG GAC GTA GGG CGG GAA TTT TCA GCA TCA ATA ATA CAG ATC 8895 
Tyr Pro Leu Asp Val Gly Pro Glu Phe Ser Ala Ser lie He Gin Met 
2845 2850 2855 2860 

TGT GGG GTG ATG CTG TCT GGA AGT GAG GAG TCC ACC CCC TCC ATC ATT 8943 
Cys Gly Val Met Leu Ser Gly Ser Glu Glu Ser Thr Pro Ser He lie 

2865 2870 2875 

TAC CAC TGT GCC CTC AGA GGC CTG GAG CGC CTC CTC CTC TCT GAG CAG 8991 
Tyr His Cys Ala Leu Arq Gly Leu Glu Arq Leu Leu Leu Ser Glu Gin 

2880 2885 2890 

CTC TCC CGC CTG GAT GCA GAA TCG CTG GTC AAG CTC AGT GTG GAC AGA 9039 
Leu Ser Arq Leu Asp Ala Glu Ser Leu Val Lys Leu Ser Val Asp Arq 

2895 2900 2905 

GTG AAC GTG CAC AGC GCG CAC CGG GCC ATG GCG GCT CTG GGC CTG ATG 9087 
Val Asn Val His Ser Pro His Arq Ala Met Ala Ala Leu Gly Leu Met 

2910 2915 2920 

CTC ACC TGC ATG TAC ACA GGA AAG GAG AAA GTC AGT CCG GGT AGA ACT 9135 
Leu Thr Cys Met Tyr Thr Gly Lys Glu Lys Val Ser Pro Gly Arq Thr 
2925 2930 2935 2940 

TCA GAC CCT AAT CCT GCA GGC CCC GAC AGC GAG TCA GTG ATT GTT GCT 9183 
Ser Asp Pro Asn Pro Ala Ala Pro Asp Ser Glu Ser Val He Val Ala 

2945 2950 2955 

ATG GAG CGG GTA TCT GTT CTT TTT GAT AGG ATC AGG AAA GGC TTT CCT 9231 
Met Glu Arq Val Ser Val Leu Phe Asp Arq He Arq Lys Gly Phe Pro 

2960 2965 2970 

TGT GAA GCC AGA GTG GTG GGC AGG ATC CTG CCC CAG TTT CTA GAC CAC 9279 
Cys Glu Ala Arq Val Val Ala Arq He Leu Pro Gin Phe Leu Asp Asp 

2975 2980 2985 

TTC TTC CCA CCC CAG GAC ATC ATG AAC AAA GTC ATC GGA GAG TTT CTG 9327 
Phe Phe Pro Pro Gin Asp He Met Asn Lys Val He Gly Glu Phe Leu 
2990 2995 3000 



TCC AAC CAG CAG 


CCA 


TAC CGC CAG TTC 


ATG 


GCC ACC GTG GTG 


TAT 


AAG 


9375 


Ser Asn Gin Gin 


Pro 


Tyr Pro Gin Phe 


Met 


Ala Thr Val Val 


Tyr 


Lys 




3005 




3010 




3015 




3020 




CTG TTT CAG ACT 


CTG 


CAC AGC ACC GGG 


CAG 


TCG TCC ATG GTC 


CGG 


GAC 


9423 


Val Phe Gin Thr 


Leu 


His Ser Thr Gly 


Gin 


Ser Ser Met Val 


Arq 


Asp 






3025 


3030 


3035 





TGG GTC ATG CTG TCC CTC TOC AAC TTC ACG CAG AGG GCC CCG GTC GCC 9471 
Trp Val Met Leu Ser Leu Ser Asn Phe Thr Gin Arq Ala Pro Val Ala 
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61 62 
3040 3045 3050 

ATG OCC ACG TGG AGC CTC TCC TGC TTC TTT GTC AGC GCG TCC ACC AGC 9519 
Met Ala Thr Trp Ser Leu Ser Cys Phe Phe Val Ser Ala Ser Thr Ser 

3055 3060 3065 

GCG TGG GTC GCG GCG ATC CT C CCA CAT GTC ATC AGC AGG ATG GGC AAG 9567 
Pro Trp Val Ala Ala He Leu Pro His Val lie Ser Arq Met Gly Lys 

3070 3075 3080 

CTG GflG CAG GTC GAC CTG AAC CTT TTC TGC CTG GTC GCC ACA GAC TTC 9615 
Leu Glu Gin Val Asp Val Asn Leu Phe Cys Leu Val Ala Thr Asp Phe 
3085 3090 3095 3100 

TAC AGA CAC CAG ATA GAG GAG GAG CTC GAC CGC AGG GCC TTC CAG TCT 9663 
Tyr Arg His Gin lie Glu Glu Glu Leu Asp Arq Arq Ala Phe Gin Ser 

3105 3110 3115 

CTG CTT GAG GTG CTT GCA GCC CCA GGA AGC CCA TAT CAC CGG CTG CTG 9711 
Val Leu Glu Val Val Ala Ala Pro Gly Ser Pro Tyr His Arq Leu Leu 

3120 3125 3130 

ACT TGT TTA CGA AAT GTC CAC AAG GTC ACC ACC TGC T CAGCGGCATG 9758 
Thr Cys Leu Arq Asn Val His Lys Val Thr Thr Cys 

3135 3140 
CTGGG/SGAGA CTGTGAGGCG GCAGCTGGGG CCGGAGCCTT TGGAAGTCTG TGCCCTTGTG 9818 
CCCTGCCTCC ACCGAGCCAG CTTGGTCCCT ATGGGCTTCC GCACATGCCG GGGGCGGCCA 9878 
GGCAACGTGC GTGTCTCTGC CATCTGGCAG AACTGCTCTT TGTGGCAGTG GCCAGGCAGG 9938 
CAGTCTCTGC AGTCCTGGTG GGGCTGAGCC TGACGCOTC CAGAAAGCAG GAGCAGCTGT 9998 
GCTGGVXCC ATGTGGGTGA CCAGGTCCTT TCTCCTGATA CTCACCTGCT GUI 1 01 1 GCC 10058 
AGGTTGCAGC TGCTCTTGCA TCTGGGCCAG AAGTCCTCCC TCCTGC/VGGC TGGCTGTTGG 10118 
GCCCTCTGCT GTCCTGCAGT AGAAGGTGOC GTGAGCAGGC TTTGGGAACA CTGGCCTGGG 10178 
TCTCCCTGGT GGGGTGTGCA TGCCACGCOC CGTGTCTGGA TGCACACATG 0CATGGCCTG 10238 
TGCTGGGCCA GTGGCTGGGG GTGCTAGACA CCCGGCACCA TTCTCCCTTC TCTCTTTTCT 10298 
TCTCAGGATT TAAAATTTAA TTATATCAGT AAACAGATTA ATTTTAACGT AAAAAAAAAA 10358 
AAAAAAAA 10366 

[0112] ie?ij#-^ : 6 * h#D5>- : flOKR 

B^iJcDfii* : 3 1 4 4 KflloaSI : * >'^ff 

Met Ala Thr Leu Glu Lys Leu Met Lys Ala Phe Glu Ser Leu Lys Ser 

15 10 15 

Phe Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 

20 25 30 

Cln Gin Gin Gin Gin Gin Gin Gin Pro Pro Pro Pro Pro Pro Pro Pro 

35 40 45 

Pro Pro Pro Gin Leu Pro Gin Pro Pro Pro Gin Ala Gin Pro Leu Leu 

50 55 60 

Pro Gin Pro Gin Pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Gly Pro 
65 70 75 80 

Ala Val Ala Glu Glu Pro Leu His Arq Pro Lys Lys Glu Leu Ser Ala 

85 90 95 

Thr Lys Lys Asp Arq Val Asn His Cys Leu Thr He Cys Glu Asn He 

100 105 110 

Val Ala Gin Ser Val Arq Asn Ser Pro Glu Phe Gin Lys Leu Leu Gly 

115 120 12 5 

He Ala Met Glu Leu Fhe Leu Leu Cys Ser Asp Asp Ala Glu Ser Asp 
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63 64 
130 135 140 

Val Arq Met Val Ala Asp Clu Cys Leu Asn Lys Val He Lys Ala Leu 
145 150 155 . 160 

Met Asp Ser Asn Leu Pro Arq Leu Gin Leu Glu Leu Tyr Lys Glu He 

165 170 175 

Lys Lys Asn Gly Ala Pro Arq Ser Leu Arq Ala Ala Leu Trp Arq Phe 

180 185 190 

Ala Glu Leu Ala His Leu Val Arq Pro Gin Lys Cys Arq Pro Tyr Leu 

195 200 205 

Val Asn Leu Leu Pro Cys Leu Thr Arq Thr Ser Lys Arq Pro Glu Glu 

210 215 220 

Ser Val Gin Glu Thr Leu Ala Ala Ala Val Pro Lys He Met Ala Ser 
225 230 235 240 

Phe Gly Asn Phe Ala Asn Asp Asn Glu He Lys Val Leu Leu Lys Ala 

245 250 255 

Phe He Ala Asn Leu Lys Ser Ser Ser Pro Thr He Arq Arq Thr Ala 

260 265 270 

Ala Gly Ser Ala Val Ser He Cys Gin His Ser Arq Arq Thr Gin Tyr 

275 280 285 

Phe Tyr Ser Trp Leu Leu Asn Val Leu Leu Gly Leu Leu Val Pro Val 

290 295 300 

Glu Asp Glu His Ser "Thr Leu Leu He Leu Gly Val Leu Leu Thr Leu 
305 310 315 320 

Arq Tyr Leu Val Pro Leu Leu Gin Gin Gin Val Lys Asp Thr Ser Leu 

325 330 335 

Lys Gly Ser Phe Gly Val Thr Arq Lys Glu Met Glu Val Ser Pro Ser 

340 345 350 

Ala Glu Gin Leu Val Gin Val Tyr Glu Leu Thr Leu His His Thr Gin 

355 360 365 

His Gin Asp His Asn Val Val Thr Gly Ala Leu Glu Leu Leu Gin an 

370 375 380 

Leu Phe Arq Thr Pro Pro Pro Glu Leu Leu Gin Thr Leu Thr Ala Val 
385 390 395 400 

Gly Gly He Gly Gin Leu Thr Ala Ala Lys Glu Glu Ser Gly Gly Arq 

405 410 415 

Ser Arq Ser Gly Ser lie Val Glu Leu He Ala Gly Gly Gly Ser Ser 

420 425 430 

Cys Ser Pro Val Leu Ser Arq Lys Gin Lys Gly Lys Val Leu Leu ay 

435 440 445 

Clu Glu Glu Ala Leu C\u Asp Asp Ser Glu Ser Arq Ser Asp Val Ser 

450 455 460 

Ser Ser Ala Leu Thr Ala Ser Val Lys Asp Glu He Ser Gly Glu Leu 
465 470 475 480 

Ala Ala Ser Ser Gly Val Ser Thr Pro Gly Ser Ala Gly His Asp He 

485 490 495 

lie Thr Glu Gin Pro Arq Ser Gin His Thr Leu Gin Ala Asp Ser Leu 

500 505 510 

Asp Leu Ala Ser Cys Asp Leu Thr Ser Ser Ala Thr Asp Gly Asp Clu 

515 520 525 

Clu Asp He Leu Ser His Ser Ser Ser Gin Val Ser Ala Val Pro Ser 
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1 



66 



65 

530 535 540 

Asp Pro Ala Met Asp Leu Asn Asp Cly Thr Gin Ala Ser Ser Pro He 
545 550 555 560 

Ser Asp Ser Ser Gin Thr TTir Thr Glu Gly Pro Asp Ser Ala Val Thr 

565 570 575 

Pro Ser Asp Ser Ser Glu He Val Leu Asp Gly Thr Asp Asn Gin Tyr 

580 585 590 

Leu Gly Leu Gin He Gly Gin Pro Gin Asp Glu Asp Glu Glu Ala Thr 

595 600 605 

Gly lie Leu Pro Asp Glu Ala Ser Glu Ala Phe Arq Asn Ser Ser Met 

6U0 615 620 

Ala Leu Gin Gin Ala His Leu Leu Lys Asn Met Ser His Cys Arq an 
62 5 630 63 5 640 

Pro Ser Asp Ser Ser Val Asp Lys Phe Val Leu Arq Asp Glu Ala Thr 

645 650 655 

Glu Pro Gly Asp Gin Clu Asn Lys Pro Cys Arq lie Lys Gly Asp He 

660 665 670 

Gly Gin Ser Thr Asp Asp Asp Ser Ala Pro Leu Val His Ser Val Arq 

675 680 685 

Leu Leu Ser Ala Ser Fhe Leu Leu Thr Gly Gly Lys Asn Val Leu Val 

690 695 700 

Pro Asp Arq Asp Val Arq Val Ser Val Lys Ala Leu Ala Leu Ser Cys 
705 710 715 720 

Val Gly Ala Ala Val Ala Leu His Pro Glu Ser Phe Phe Ser Lys Leu 

72 5 730 735 

Tyr Lys Val Pro Leu Asp Thr Thr Glu Tyr Pro Glu Glu Gin Tyr Val 

740 745 750 

Ser Asp lie Leu Asn Tyr He Asp His Gly Asp Pro Gin Val Arq ay 

755 760 765 

Ala Thr Ala lie Leu Cys Gly Thr Leu He Cys Ser He Leu Ser Arq 

770 775 780 

Ser Arq Phe His Val ay Asp Trp Met Gly Thr He Arq Thr Leu Thr 
785 790 795 800 

ay Asn Thr Phe Ser Leu Ala Asp Cys He Pro Leu Leu Arq Lys Thr 

805 810 815 

Leu Lys Asp Glu Ser Ser Val Thr Cys Lys Leu Ala Cys Thr Ala Val 

820 825 830 

Arq Asn Cys Val Met Ser Leu Cys Ser Ser Ser Tyr Ser Glu Leu ay 

835 840 845 

Leu Gin Leu lie He Asp Val Leu Thr Leu Arq Asn Ser Ser Tyr Trp 

850 855 860 

Leu Val Arq Thr Glu Leu Leu Glu Thr Leu Ala Glu He Asp Phe Arq 
865 870 875 880 

Leu Val Ser Phe Leu Clu Ala Lys Ala Glu Asn Leu His Arq Gly Ala 

885 890 895 

His His Tyr Thr Gly Leu Leu Lys Leu Gin Glu Arq Val Leu Asn Asn 

900 905 910 

Val Val He His Leu Leu Gly Asp Glu Asp Pro Arq Val Arq His Val 

915 920 925 

Ala Ala Ala Ser Leu He Arq Leu Val Pro Lys Leu Phe Tyr Lys Cys 
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67 



68 



930 



935 



940 



Asp Cln Glv Gin Ala Asp Pro Val Val Ala Val Ala Arg Asp Gin Ser 
945 950 955 960 

Ser Val Tyr Leu Lys Leu Leu Met His Glu Thr Gin Pro Pro Ser His 

965 970 975 

Phe Ser Val Ser Thr lie Thr Arq He Tyr Arq Gly Tyr Asn Leu Leu 

980 985 990 

Pro Ser He Thr Asp Val Thr Met Glu Asn Asn Leu Ser Arq Val He 

995 1000 1005 

Ala Ala Val Ser His Glu Leu He Thr Ser Thr Thr Arq Ala Leu Thr 

1010 1015 1020 

Phe Gly Cys Cys Glu Ala Leu Cys Leu Leu Ser Thr Ala Phe Pro Val 
1025 1030 1035 1040 

Cys He Trp Ser Leu Gly Trp His Cys Gly Val Pro Pro Leu Ser Ala 

1045 1050 1055 

Ser Asp Glu Ser Arq Lys Ser Cys Thr Val Gly Met Ala Thr Met He 

1060 1065 1070 

Leu Thr Leu Leu Ser Ser Ala Trp Phe Pro Leu Asp Leu Ser Ala His 

1075 1080 1085 

Gin Asp Ala Leu He Leu Ala Gly Asn Leu Leu Ala Ala Ser Ala Pro 

1090 1095 1100 

Lys Ser Leu Arq Ser Ser Trp Ala Ser Glu Glu Glu Ala Asn Pro Ala 
H05 1110 1115 1120 

Ala Thr Lys Gin Glu Glu Val Trp Pro Ala Leu Gly Asp Arq Ala Leu 

1125 1130 1135 

Val Pro Met Val Glu Gin Leu Phe Ser His Leu Leu Lys Val He Asn 

1140 1145 1150 

lie Cys Ala His Val Leu Asp Asp Val Ala Pro Gly Pro Ala He Lys 

1155 1160 U65 

Ala Ala Leu Pro Ser Leu Thr Asn Pro Pro Ser Leu Ser Pro He Arq 

1170 1175 1180 

Arq Lys Gly Lys Glu Lys Glu Pro Gly Glu Gin Ala Ser Val Pro Leu 
1185 1190 1195 1200 

Ser Pro Lys Lys Gly Ser Glu Ala Ser Ala Ala Ser Arq Gin Ser Asp 



Thr Ser Gly Pro Val Thr Thr Ser Lys Ser Ser Ser Leu Gly Ser Phe 

1220 1225 1230 

Tyr His Leu Pro Ser Tyr Leu Arq Leu His Asp Val Leu Lys Ala Thr 

1235 1240 1245 

His Ala Asn Tyr Lys Val Thr Leu Asp Leu Gin Asn Ser Thr Glu Lys 

1250 1255 1260 

Phe Gly Gly Phe Leu Arq Ser Ala Leu Asp Val Leu Ser Gin He Leu 
1265 1270 1275 128< 

Glu Leu Ala Thr Leu Cln Asp He Gly Lys Cys Val Glu Glu He Leu 

1285 1290 1295 

Gly Tyr Leu Lys Ser Cys Phe Ser Arq Glu Pro Met Met Ala Thr Val 

1300 1305 1310 

Cys Val Gin Gin Leu Leu Lys Thr Leu Phe Gly Thr Asn Leu Ala Ser 

1315 1320 1325 

Cln Phe Asp Gly Leu Ser Ser Asn Pro Ser Lys Ser Gin Gly Arq Ala 



1205 



1210 



1215 
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69 



70 



1330 



1335 



1340 



Gin Arg Leu Gly Ser Ser Ser Val Arg Pro Gly Leu Tyr His Tyr Cys 
1345 1350 1355 1360 

Phe Met Ala Pro Tyr Thr His Phe Thr Gin Ala Leu Ala Asp Ala Ser 

1365 1370 1375 

Leu Arg Asn Met Val Qn Ala Glu Gin Glu Asn Asp Thr Ser Gly Trp 

1380 1385 1390 

Phe Asp Val Leu Gin Lys Val Ser Thr Gin Leu Lys Thr Asn Leu Thr 

1395 1400 1405 

Ser Val Thr Lys Asn Arg Ala Asp Lys Asn Ala lie His Asn His lie 

1410 1415 1420 

Arg Leu Phe Glu Pro Leu Val lie Lys Ala Leu Lys Gin Tyr Thr Thr 
1425 1430 1435 1440 

Thr Thr Cys Val Gin Leu Gin Lys Gin Val Leu Asp Leu Leu Ala Gin 

1445 1450 1455 

Leu Val Gin Leu Arg Val Asn Tyr Cys Leu Leu Asp Ser Asp Gin Val 

1460 1465 1470 

Phe He Gly Phe Val Leu Lys Gin Phe Glu Tyr He Glu Val Gly an 

1475 1480 1485 

Phe Arg Glu Ser Glu Ala lie lie Pro Asn He Phe Phe Phe Leu Val 

1490 1495 1500 

Leu Leu Ser Tyr Glu Arg Tyr His Ser Lys Gin He lie Gly lie Pro 
1505 1510 1515 1520 

Lys lie He Gin Leu Cys Asp Gly lie Met Ala Ser Gly Arq Lys Ala 

1525 1530 1535 

Val Thr His Ala lie Pro Ala Leu Gin Pro He Val His Asp Leu Phe 

1540 1545 1550 

Val Leu Arg Gly Thr Asn Lys Ala Asp Ala Gly Lys Glu Leu Glu Thr 

1555 1560 1565 

an Lys Glu Val Val Val Ser Met Leu Leu Arg Leu He Gin Tyr His 

1570 1575 1580 

an Val Leu Glu Met Phe He Leu Val Leu an Gin Cys His Lys Glu 
1585 1590 1595 160C 

Asn Glu Asp Lys Trp Lys Arg Leu Ser Arq Gin He Ala Asp He He 

1605 1610 1615 

Leu Pro Met Leu Ala Lys Gin Gin Met His lie Asp Ser His Glu Ala 

1620 1625 1630 

Leu Gly Val Leu Asn Thr Leu Phe Glu He Leu Ala Pro Ser Ser Leu 

1635 1640 1645 

Arg Pro Val Asp Met Leu Leu Arg Ser Met Phe Val Thr Pro Asn Thr 

1650 1655 1660 

Met Ala Ser Val Ser Thr Val Gin Leu Trp He Ser Gly He Leu Ala 
1665 1670 1675 168( 

He Leu Arq Val Leu He Ser Gin Ser Thr Glu Asp He Val Leu Ser 

1685 1690 1695 

Arg He Gin Glu Leu Ser Phe Ser Pro Tyr Leu He Ser Cys Thr Val 

1700 1705 1710 

He Asn Arg Leu Arg Asp Gly Asp Ser Thr Ser Thr Leu Glu Glu His 

1715 1720 1725 

Ser Glu Gly Lys Gin He Lys Asn Leu Pro Glu Glu Thr Phe Ser Arg 
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71 



72 



1730 



1735 



1740 



Rig Leu Leu Gin Leu Val Gly He Leu Leu Glu Asp He Val Thr Lys 
1745 1750 1755 176' 

Gin Leu Lys Val Glu Met Ser Glu Gin Gin His Thr Phe Tyr Cys an 

1765 1770 1775 

Glu Leu Gly Thr Leu Leu Met Cys Leu lie His lie Phe Lys Ser ay 

1780 1785 1790 

Met Phe Arq Arg He Thr Ala Ala Ala Thr Arg Leu Phe Arg Ser Asp 

1795 1800 1805 

ay Cys Gly Gly Ser Fhe Tyr Thr Leu Asp Ser Leu Asn Leu Arq Ala 

1810 1815 1820 

Arg Ser Met He Thr "Thr His Pro Ala Leu Val Leu Leu Trp Cys an 
1825 1830 1835 184< 

He Leu Leu Leu Val Asn His Thr Asp Tyr Arq Trp Trp Ala Glu Val 

1845 1850 1855 

an Gin Thr Pro Lys Arg His Ser Leu Ser Ser Thr Lys Leu Leu Ser 

1860 1865 1870 

Pro Gin Met Ser Gly Clu Glu Glu Asp Ser Asp Leu Ala Ala Lys Leu 

1875 1880 1885 

ay Met Cys Asn Arq Clu He Val Arq Arq Gly Ala Leu He Leu Phe 

1890 1895 1900 

Cys Asp Tyr Val Cys an Asn Leu His Asp Ser Glu His Leu Thr Trp 
1905 1910 1915 192( 

Leu He Val Asn His lie Gin Asp Leu He Ser Leu Ser His Glu Pro 

1925 1930 1935 

Pro Val Gin Asp Phe He Ser Ala Val His Arq Asn Ser Ala Ala Ser 

1940 1945 1950 

ay Leu Phe He Gin Ala He Gin Ser Arq Cys Glu Asn Leu Ser Thr 

1955 1960 1965 

Pro Thr Met Leu Lys Lys Thr Leu Gin Cys Leu Glu Gly He His Leu 

1970 1975 1980 

Ser Gin Ser Gly Ala Val Leu Thr Leu Tyr Val Asp Arq Leu Leu Cys 
1985 1990 1995 200C 

Thr Pro Phe Arg Val Leu Ala Arq Met Val Asp He Leu Ala Cys Arg 

2005 2010 2015 

Arq Val Glu Met Leu Leu Ala Ala Asn Leu Gin Ser Ser Met Ala an 

2020 2025 2030 

Leu Pro Met Glu Glu Leu Asn Arq He Gin Glu Tyr Leu Gin Ser Ser 

2035 2040 2045 

ay Leu Ala Gin Arq His Gin Arq Leu Tyr Ser Leu Leu Asp Arq Phe 

2050 2055 2060 

Arq Leu Ser Thr Met an Asp Ser Leu Ser Pro Ser Pro Pro Val Ser 
2065 2070 2075 208C 

Ser His Pro Leu Asp ay Asp Gly His Val Ser Leu Glu Thr Val Ser 

2085 2090 2095 

Pro Asp Lys Asp Trp Tyr Val His Leu Val Lys Ser Gin Cys Trp Thr 

2100 2105 2110 

Arq Ser Asp Ser Ala Leu Leu Glu Gly Ala Glu Leu Val Asn Arq He 

2115 2120 2125 

Pro Ala Glu Asp Met Asn Ala Phe Met Met Asn Ser Glu Phe Asn Leu 
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73 



74 



2130 



2135 



2140 



Ser Leu Leu Ala Pro Cys Leu Ser Leu Gly Ntet Ser Glu He Ser Gly 
2145 2150 2155 216< 

Gly Gin Lys Ser Ala Leu Phe Glu Ala Ala Arq Glu Val Thr Leu Ala 

2165 2170 2175 

Arq Val Ser Gly Thr Val Gin Gin Leu Pro Ala Val His His Val Phe 

2180 2185 2190 

Gin Pro Glu Leu Pro Ala Glu Pro Ala Ala Tyr Trp Ser Lys Leu Asn 

2195 2200 2205 

Asp Leu Phe Gly Asp Ala Ala Leu Tyr Gin Ser Leu Pro Thr Leu Ala 

2210 2215 2220 

Arq Ala Leu Ala Gin Tyr Leu Val Val Val Ser Lys Leu Pro Ser His 
2225 2230 2235 2244 

Leu His Leu Pro Pro Glu Lys Glu Lys Asp lie Val Lys Phe Val Val 

2245 2250 2255 

Ala Thr Leu Glu Ala Leu Ser Trp His Leu lie His Glu Gin He Pro 

2260 2265 2270 

Leu Ser Leu Asp Leu Gin Ala Gly Leu Asp Cys Cys Cys Leu Ala Leu 

2275 2280 2285 

Gin Leu Pro Gly Leu Trp Ser Val Val Ser Ser Thr Glu Phe Val Thr 

2290 2295 2300 

His Ala Cys Ser Leu lie Tyr Cys Val His Phe He Leu Glu Ala Val 
2305 2310 2315 232< 

Ala Val Gin Pro Gly Glu Gin Leu Leu Ser Pro Glu Arq Arq Thr Asn 

2325 2330 2335 

"Thr Pro Lys Ala He Ser Glu Glu Glu Glu Glu Val Asp Pro Asn Thr 

2340 2345 2350 

Gin Asn Pro Lys Tyr He Thr Ala Ala Cys Glu Met Val Ala Glu Met 

2355 2360 2365 

Val Glu Ser Leu Gin Ser Val Leu Ala Leu Gly His Lys Arq Asn Ser 

2370 2375 2380 

Gly Val Pro Ala Phe Leu Thr Pro Leu Leu Arq Asn He He He Ser 
2385 2390 2395 240< 

Leu Ala Arq Leu Pro Leu Val Asn Ser Tyr Thr Arq Val Pro Pro Leu 

2405 2410 2415 

Val Trp Lys Leu Gly Trp Ser Pro Lys Pro Gly Gly Asp Phe Gly Thr 

2420 2425 2430 

Ala Phe Pro Glu He Pro Val Glu Phe Leu Gin Glu Lys Glu Val Fhe 

2435 2440 2445 

Lys Glu Phe He Tyr Arq He Asn Thr Leu Gly Trp Thr Ser Arq Thr 

2450 2455 2460 

Gin Phe Glu Glu Thr Trp Ala Thr Leu Leu Gly Val Leu Val Thr Gin 
2465 2470 2475 248( 

Pro Leu Val Met Glu Gin Glu Glu Ser Pro Pro Glu Glu Asp Thr Glu 

2485 2490 2495 

Arq Thr Gin He Asn Val Leu Ala Val Gin Ala He Thr Ser Leu Val 

2500 . 2505 2510 

Leu Ser Ala Met Thr Val Pro Val Ala Gly Asn Pro Ala Val Ser Cys 

2515 2520 2525 

Leu Glu Gin Gin Pro Arq Asn Lys Pro Leu Lys Ala Leu Asp Thr Arq 
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75 



76 



2530 



2535 



2540 



Fhe Gly Arq Lys Leu Ser lie lie Arg Gly He Val Glu Gin Glu He 
2545 2550 2555 2560 

an Ala Met Val Ser Lys Arq Glu Asn lie Ala Thr His His Leu Tyr 

2565 2570 2575 

Gin Ala Trp Asp Pro Val Pro Ser Leu Ser Pro Ala Thr Thr Gly Ala 

2580 2585 2590 

Leu lie Ser His Glu Lys Leu Leu Leu Gin He Asn Pro Glu Arq Glu 

2595 2600 2605 

Leu Gly Ser Met Ser Tyr Lys Leu Gly Gin Val Ser He His Ser Val 

2610 2615 2620 

Trp Leu Gly Asn Ser He Thr Pro Leu Arq Glu Glu Glu Trp Asp Glu 
2625 2630 2635 2640 

Glu Glu Glu Glu Glu Ala Asp Ala Pro Ala Pro Ser Ser Pro Pro "Thr 



Ser Pro Val Asn Ser Arq Lys His Arq Ala Gly Val Asp He His Ser 

2660 2665 2670 

Cys Ser Gin Phe Leu Leu Glu Leu Tyr Ser Arq Trp He Leu Pro Ser 

2675 2680 2685 

Ser Ser Ala Arq Arq Thr Pro Ala lie Leu lie Ser Glu Val Val Arq 

2690 2695 2700 

Ser Leu Leu Val Val Ser Asp Leu Phe Thr Glu Arq Asn Gin Phe Glu 
2705 2710 2715 2721 

Leu Met Tyr Val Thr Leu Thr Glu Leu Arq Arq Val His Pro Ser Glu 

2725 2730 2735 

Asp Glu He Leu Ala Gin Tyr Leu Val Pro Ala Thr Cys Lys Ala Ala 

2740 2745 2750 

Ala Val Leu Gly Met Asp Lys Ala Val Ala Glu Pro Val Ser Arq Leu 

2755 2760 2765 

Leu Glu Ser Thr Leu Arq Ser Ser His Leu Pro Ser Arq Val Gly Ala 

2770 2775 2780 

Leu His Gly He Leu Tyr Val Leu Glu Cys Asp Leu Leu Asp Asp Thr 
2785 2790 2795 280( 

Ala Lys Gin Leu He Pro Val He Ser Asp Tyr Leu Leu Ser Asn Leu 

2805 2810 2815 

Lys Gly He Ala His Cys Val Asn He His Ser Gin Gin His Val Leu 

2820 2825 2830 

Val Met Cys Ala Thr Ala Phe Tyr Leu He Glu Asn Tyr Pro Leu Asp 

2835 2840 2845 

Val Gly Pro Glu Phe Ser Ala Ser He He Gin Met Cys Gly Val Met 

2850 2855 2860 

Leu Ser Gly Ser Glu Glu Ser Thr Pro Ser lie lie Tyr His Cys Ala 
2865 2870 2875 288C 

Leu Arq Gly Leu Glu Arq Leu Leu Leu Ser Glu Gin Leu Ser Arq Leu 

2885 2890 2895 

Asp Ala Glu Ser Leu Val Lys Leu Ser Val Asp Arq Val Asn Val His 

2900 2905 2910 

Ser Pro His Arq Ala Met Ala Ala Leu Gly Leu Met Leu Thr Cys Met 

2915 2920 2925 

Tyr Thr Gly Lys Glu Lys Val Ser Pro Gly Arq Thr Ser Asp Pro Asn 



2645 



2650 



2655 



(40) 

77 78 
2930 2935 , 2940 

Pro Ala Ala Pro Asp Ser Glu Ser Val He Val Ala Met Glu Arq Val 
2945 2950 2955 2960 

Ser Val Leu Phe Asp Arq He Arq Lys Gly Phe Pro Cys Glu Ala Arq 

2965 2970 2975 

Val Val Ala Arq He Leu Pro Gin Phe Leu Asp Asp Phe Phe Pro Pro 

2980 2985 2990 

Gin Asp He Met Asn Lys Val He Gly Glu Phe Leu Ser Asn Gin Gin 

2995 3000 3005 

Pro Tyr Pro Gin Phe Met Ala Thr Val Val Tyr Lys Val Phe Gin Thr 

3010 3015 3020 

Leu His Ser Thr Gly Gin Ser Ser Met Val Arq Asp Trp Val Met Leu 
3025 3030 3035 3040 

Ser Leu Ser Asn Phe Thr Gin Arq Ala Pro Val Ala Met Ala Thr Trp 

3045 3050 3055 

Ser Leu Ser Cys Phe Phe Val Ser Ala Ser Thr Ser Pro Trp Val Ala 

3060 3065 3070 

Ala He Leu Pro His Val He Ser Arq Met Gly Lys Leu Glu Gin Val 

3075 3080 3085 

Asp Val Asn Leu Phe Cys Leu Val Ala Thr Asp Phe Tyr Arq His Gin 

3090 3095 3100 

He Glu Glu Glu Leu Asp Arq Arq Ala Phe Gin Ser Val Leu Glu Val 
3105 3110 3115 3120 

Val Ala Ala Pro Gly Ser Pro Tyr His Arq Leu Leu Thr Cys Leu Arq 



45HPF7 -67 6 6 1 



3125 3130 
Asn Val His Lys Val Thr Thr Cys 

3140 

[01] h Dm$mm<D^mm<Dfflmwmx*&z 0 

[02] ^>^>^> ( I T 1 5) ^f*CDy-1f> 30 
[03] I T 1 5K?M«St^cDNA^O-> 

[04] ^>^>^> ( i t 1 5) <ommm(Dm& 

[0 5] ^>^>^> ( I T 1 5) <Dumm<Dtp& 
[0 6] ^>*>*> ( IT1 5) <Dumm<DW 

SP#-C$>£ e 40 
[0 7] (CAG)n Kffl©DNAKM#«©iB*£7S 

[0 8 ] ^^tt«:^r»o*©^tt*ffll*fc^* 
Xi^HD nmm<D (CAG)n KtS<D P C RM©M* 

[09] WBSSCC-3l>T#*«^©^*ffi 
l>/c^*Xx^HDI^)iS^CD(CAG)n Jg«<DPCR# 

[010] £g#H Dmtmicoi>r**:m&<D®ft 

%mi<>tcTJ U XtfcCD— mtctoVZ>(C AG)n JSftCDP 50 



3135 



[012] HDzsizmcrtmzhtLzmfctemm 

[013] *tM^fett^HD^fe»±OCCAG)n 
Sf8#fl[©»©tfc«S:^r if 9 7 V * & 0 

[014] mm&tf&m&%fcmti2>HDm&#±. 
[015] «aRocje»3^6easn*HD»aft± 

CD(CAG)n JS«^{4^<Dtb$5^"r^^^r$>^ 0 
[016] 3o<D^&:7t ^ y-ft*©HDSft&f* 
±cr>(CAG)n ^*tt<D»*S5(DHD»««iJt«L 

[017] 3o(Dg^7 7 5 «;-ft*0HDSfefef* 
±<7>CCAG)n E&mm>&*%WHD9fflmttmi, 

[018] 3-5©g^ft7y 5 y-ft*OHDSfeef* 
±<D(CAG)n S«*Cft<D»*8ij<DHD»^ilteL/ 



(41) 



?#§3¥7 - 6 7 6 6 1 



79 



[019] WSI&UWJfrr*T«©(CAG)n 
[02 0 ] ffim&.V*tfo?Z>*1&<D<;CAG)n JgfflS 
1] ffi- : F-S^';>^ , ^DNAS*(DHD (C * 



80 



[02 2 J »fi^£Sa*(ft©fi3 4©IW««:m-r 



[01 ] 



[02] 



z> 

s 
t 
o 



4 

a 
i 
a 



DD O 
4 4 4 
BBS 
19 1 
2 6 6 

r 2 



600 kb 



I II! 



o 

4 
3 
9 
8 

I 



HI? ¥ ? ■ 

Ml rftfMlN MM 

LLil 




Ltca 



LW2C5 



IT15 



JT15 



IT11 IT10O3 ADDA 



[03] 



12 3 




— 28S 



Alt 




[07] 

AG CT 



3 

AGCT 



IT16C 
IT16A 
1T16B 
IT15B 
1T15A 



An 



r. 



IE*3X«K iE«cONA 



is 



(CAG) 



(CAG} 4 » 



(42) 



[H34 ] 



^¥7 -67 6 6 1 



tZI CC06QCCttCeCTCCCCCCecC«OCTCTttCAC0CjA^^ TC 

»1 ATK«CUiUftt<CTftASCBSOCCC&CCAST Ca C<^^ 

1 .HAT LffttftKAftC lie 

H ft f ft a 0 .0 ft ft.; ft f .0 Aft ««Qft qt ««4«<l • * * * T * * f * P F ft « L ♦ « 

A§1 CCSCCCCCCa iO CiaiCWCC CCT ccrcofr f « rnr»- < r rrr . rrn iW r <* n i 1 1 1 i iQCCOeCAftCOC rmxx*. TCTMneTt*£X^rrmrTAr*r*s*j^~.* .-..*+ mm ~^-. 

» m f.Q a »r i t r a ft • rrrr^rr-rffcp a v a c a * t itTnTti r 

T$ 1 jr.i I I M C < < V « t t 1 A A A V t ( ) I A i r t I U I M e u v tic 

XMA f * A(jl M < I. ft t 1 A A ft 1 A V 1 I C ft ft ft ft ft I « T f T I W Lift 

«2J ftTCOCTTAftcniAfranic^ 

296 V L t 6 iT% V f V C ft * ,* ft 1 ^ L 11 I V- I A I 1 I .T t ft ft L L • ft o iTlTl 

tgi fciwAftgycTf y 

Uji Accg agccp cgtotn^ cwitboc 

*1* 1 I A l f I, L fl • I f I T. f t ft ft 1 £ lit? A.V ft 6 1 ft ft t ft Aft ft C f ft ft ft 

W1 Ce AAO TO Wft6<yAOgrftCMCTATAftCrftftA^ ICC 1 1 K tAGM J+ Ct M < < P ftCMflCTBCICt TAPQAfll A ftMC AftBC CT TftAAPCftt 

. ** • • «.. « » j v « € i a ft ft • ft s c « r v c « a k a ft ft k v u ft ■ c t rn • 

ft^CrAMT<^|C<^CTC*cc^CTft^ 

V*. • « t ft ft ft ft ft ft ft ft A L f A * ft ft 0 ft I ft ft ft L A A ft ft ft ft ft ¥ P ft ft A ft ft ft 

tgj pyocAjtyeftftajM ^ ^ 




S» f ft ft ft ft ft ft. I .v v ft. ■ | ft ft ft ft ft ft I ft. If«f IICMCATII1MIA 

■si rryyyrft^^ 

CTCWAAigtMftOTftA CACCftjE^^ 
714 L T ft V ft ft ft ft t .t T ft* E fj ft T ft ft ft I L I T t ft I U M V I U I A I L C O 

m n ^jt t i i .1 i i i f i v • i « n't r i i i i t ftHTftftiAO ei r t a a c 

ftTAl ACVfCAAftAATtitaTCCTafT^^ 

r 4.ftft«fttVftCftlA CTAft.ftllCftft«ftCft»»T«E L t L ft ft I I ft V 

Wft A « ft T T C L-.l ft t ft* «.,A VlAlVVSftltOOCftftftVl HVAAAftftlft 

3 J21 pWTODCAA^AftqqTnATAAAW 
V36LV P ft ft .ft tftCftOCftAD PftVAVAtl«**VT l* L l K V I 1 • ^ f I 

£*^i^ C ^^« A «W*«CAgMT4TATAfiA6^ATMCCT^A0C^^ 
ftKH f ft V * T 1 T «.tTftf€THLl*ftf T D ft T ftl £ K ft L ft K V I A A ft ft ft E t 

016 I 7 S T 1 l A I Tr6CCCALClt*Tftr»VC| VAtCWiCftft^ ft « 



(43) - 67 6 6 1 



t * i «e¥TY¥^*W*2^ 



r» - ■ « .« * * i t . . . . ■^■r^ T "^^^^„....^ 



4$l§¥7 -67 6 6 1 



ZZt* C. T Q * I f 1 t A ft A L Aft t I VVV« KCfSKlftl 9 t t t C ft 0 J V «* V 

m* v*» t c a 1 y m 1 ■ « f « t ft I ft ft. » <. 4 * c l ■ c«ccac Tt f rTIT^ 

»§4 * v c $ r t ivt ft a c « l t rc ¥tpi t TTir A¥ cr a Bttt r I f e ■ 1 I T 




»4«6 ft ft J « I * .f t A ¥ *.A I I ft I V I ft A. ft) it V V A 4 iTTT«« i I 

K * L K A if 1 ft f C « ftt 1 ft I t ft .ft ft V C « C I CAMVSCftCft I ATI • t 



f«1 tATCAaoCATftOttATCCfftTfttCnCTCTftTCTCOMCTACTAOAftfltft Ctcn At- - _ ._ . . _._ . _ . . . „ 

T»AV»t*r« t ft r A t t 6 A L IIICKllLI t ft f € ft c iTTSTlT 



fiB ■ - •iTrn—rnT' tr rrr ir rn ia n ftftAijiJn 

2666 t iM.riu ji *. a< y.ft t K *> e ft « f v t c i r | t it iTr t « iTmr 

A I L | ft C V ¥ ft ft ft. C V ¥ ft 



« *. ¥ rAYCC A AA YftCMftft; A VA.CftVftftLLttTC** 



«.»*»♦ t V « l • ■ UlMMtllllMCIT T « ■ c c v 



»tt w v r r v # q t l * • r.t * « ii v. « i «t % nTi 



«04t TftCrceetAfiCC Ca MCBO CAf CCCCCOC aCOOCO BBCA ftftftCftpftl l CI C I L*A* TgfOCC*UU^CgTCmCT6«^ 

*f<\ ^CTftlftCCTaACaeCTt CCftCAAW i qi^ t I Ct U. I CAt^C<mfteTCmftTTQW«nCC»BCTftCTCnCCATC 

TOpet T MetcAft AAftTCtTceeicg€ieiu^<»cTO 

wtot *y«»«rTTHrm wcT«<iAT«c»fc^t«c^ n i u a 1 1 u t n il nm nwmmwwmwir f 

<«31 ATATCAftTAAAfiA^TIAftTTTWUlCftTAOAdUWUUUtfMUWAitAft 



*$Ki¥7 - 6 7 6 6 1 



[B8] [010] 




t$&3¥7 -67 66 1 



[011] 



i. 0 L r0 z 



m. 

1,2 1.3* 3?4 5.6 3^5 4.5 




[012] 



n 



jzh -0 



f,Z 3,4 6,6 itS 2,6 

# • • 




[013] 



[02 1 ] 



HH Sfefett 



□ HD Sfg* 




1 2 3 4 5 

i 1 r 1 i 1 i 1 r~ — ~i 

SLSL SLSLSL 




(47) 



tS§3¥7 -67 66 1 



[014] 




[HI 5 3 




0 20 40 60 80 100 



(48) 



7 - 67 6 6 1 





(49) 



<tmW-7 - 6 7 6 6 1 




19] 



-N 
Jf5 



90 



80 



70 



60 - 



60 



40 



30 




30 36 40 46 



• * — 



J L 



60 66 60 



65 70 76 80 85 90 



(50) 



f#BfJ¥7 -67 66 1 



[020] 





90 




B5 




80 




75 




70 






sas 


65 




60 




55 




50 




45 




40 




35 




30 



o 



"o °6 
oo o 




J L 



J J_ 



. -v-' 



30 35 40 45 60 55 



60 



66 



70 75 80 85 90 



[022] 




(51) Int. CI . 6 

C 0 7 K 16/18 
C 1 2 P 21/02 
21/08 

C 1 2 Q 1/68 
G 0 1 N 33/50 



C 9282- 4B 
9161- 4B 
A 9453- 4B 
T 



9SSB¥7-67 6 6 1 



42# :J->^-3105 



24# 

7# 



Searching PAJ 



1/2 ^— v 



PATENT ABSTRACTS OF JAPAN 



(1 1 publication number : 07-067661 
(43)Date of publication of application : 14.03.1995 



(51)Int.CI. 



C12N 
A61K 
C07K 
C07K 
C12P 
C12P 
C12Q 
G01N 



15/09 
38/00 
14/47 
16/18 
21/02 
21/08 
1/68 
33/50 



(21 Application number 
(22)Date of filing : 



06-036026 
07.03.1994 



(71) Applicant 

(72) Inventor : 



GENERAL HOSPITAL CORP:THE 

MACDONALD MARCY E 
AMBROSE CHRISTINE M 
DUYAO MABEL P 
GUSELLA JAMES F 



(30)Priority 
Priority number 



93 27498 
93 85000 



Priority date : 05.03.1993 

01.07.1993 



Priority country : US 



US 



(54) HUNTINGTIN DNA, ITS PROTEIN AND USE THEREOF 

(57)Abstract: 

PURPOSE: To obtain a new nucleic acid useful for gene therapy of Huntington's chorea or the 
like and for providing a recombinant DNA technique for diagnosis and treatment of Huntington's 
chorea, 

CONSTITUTION: This isolated nucleic acid contains a nucleic acid encoding a huntingtin protein 
and is a gene spanning about 210 kb and encoding a protein having about 348 kDa. The gene is 
derived from the adjoining part of 500 kb segment between chromosome marker D4S180 and 
D4S182, is isolated from a cosmid contigs of the candidate region by a cloned trapped exon. A 
(CAG)n trinucleotide repeat varying from 37 to at least 86 copies is amplified by regulation, 
localization, stability or translatability of the mRNA. 



LEGAL STATUS 

[Date of request for examination] 

[Date of sending the examiners decision of 
rejection] 

[Kind of final disposal of application other than 
the examiner's decision of rejection or 
application converted registration] 

[Date of final disposal for application] 

[Patent number] 

[Date of registration] 



07.03.2001 
04.11.2003 



http://www19.ipdl.ncipi.gojp/PA1/result/detail/main/wAAAxOaiZaDA407067661P... 2005/10/25 



Searching PAJ 2/2 "C— £/ 

[Number of appeal against examiners decision 
of rejection] 

[Date of requesting appeal against examiner's 
decision of rejection] 

[Date of extinction of right] 

Copyright (C); 1998,2003 Japan Patent Office 



http://www19.ipdl.ncipi.gojp/PA1/result/detail/main/wAAAxOaiZaDA407067661P... 2005/10/25 



m PAGE BIWIC ^sptg) 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
-QTfADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 



LJ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




